Doc Ref. AN2 
No - 09/ $&$! 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



ITELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




(51) International Patent Classification 6 : 




(11) International Publication Number: 


WO 97/20918 


C12N 9/00 


Al 


(43) International Publication Date: 


12 June 1997(12.06.97) 



(21) International Application Number: PCT/US96/ 19457 

(22) International Filing Date: 6 December 1996 (06.12.96) 



(30) Priority Data: 

60/008,316 
60/008,317 
60/008,3 1 1 
08/651,568 
08/692,002 



7 December 1995 (07. 1 2.95) US 

7 December 1 995 (07. 1 2.95) US 

7 December 1 995 (07. 1 2.95) US 

22 May 1996 (22.05.96) US 

2 August 1996(02.08.96) US 



(81) Designated States: AU, CA, !L, JP, European patent (AT, BE, 
CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE). 



Published 

With international search report. 



(71) Applicant: RECOMBINANT BIOCATALYSIS, INC. 

[US/US]; 505 Coast Boulevard South, La Jolla, CA 92037 
(US). 

(72) Inventor: SHORT, Jay. M.; 320 Delage Drive, Encinitas, CA 

92024 (US). 

(74) Agents: HERRON, Charles; J. et al.; Carella, Byrne, Bain, 
Gilfillan, Cecchi, Stewart & Olstein, 6 Becker Farm Road, 
Roseland, NJ 07068 (US). 



(54) Title: METHOD OF SCREENING FOR ENZYME ACTIVITY 
(57) Abstract 

Disclosed are processes to identify desired enzymatic activity from a pool of DNA collected from one or more organisms or a DNA 
subjected to random directed mutagenesis. The methods involve the generation of DNA library in a host cell and screening for the desired 
activity. The process can be applied to develop thermally stable proteins having improved enzymatic activity at lower temperature. 



BEST AVAILABLE COPY 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Armenia 


AT 


Austria 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BV 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CG 


Congo 


CH 


Switzerland 


CI 


C6te d*l voire 


CM 


Cameroon 


CN 


China 


CS 


Czechoslovakia 


CZ 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 


ES 


Spain 


n 


Finland 


FR 


France 


CA 


Gabon 



GB 


United Kingdom 


GE 


Georgia 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgystan 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


LI 


Liechtenstein 


LK 


Sri Lanka 


LR 


Liberia 


LT 


Lithuania 


LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco ' 


MD 


Republic of Moldova 


MG 


Madagascar 


MI, 


Mali 


MN 


Mongolia 


MR 


Mauritania 



MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SF, 


Sweden 


SG 


Singapore 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 



WO 97/20918 



PCT/US96/19457 



METHOD OF SCREENING FOR ENZYME ACTIVITY 

This application is a continuation-in-part of U.S. 
application serial no. 08/692,002 filed on August 2, 1996 
(copending) which is a continuation-in-part of U.S. 
provisional application no. 60/008,317 which was filed on 
December 7, 1995 (copending) ; a continuation-in-part of 
U.S. application serial no. 08/651,568 filed on May 22, 
1996 (copending) which is a continuation-in-part of U.S. 
provisional application no. 60/008,316 which was filed on 
December 7, 1995 (copending) ; and a continuation-in-part 
of U.S. provisional application no. 60/008,311 which was 
filed on December 7, 1995 (copending). 

The present invention relates to the production and 
screening of expression libraries for enzyme activity 
and, more particularly, to obtaining selected DNA from 
DNA of a microorganism and to screening of an expression 
library for enzyme activity which is produced from 
selected DNA, to the directed mutagenesis of DNA and 
screening of clones containing the mutagenized DNA for 
resultant specified protein, particularly enzyme, 
activity (ies) of interest, and to thermostable enzymes, 
more particularly, the present invention relates to 
thermostable enzymes which are stable at high temperature 
and which have improved activity at lower temperatures. 
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SUMMARY OF THE INVENTION 

Industry has recognized the need for new enzymes for 
a wide variety of industrial applications. As a result, 
a variety of microorganisms have been screened to 
ascertain whether such microorganisms have a desired 
enzyme activity. If such microorganisms do have the 
desired enzyme activity, the enzyme is then recovered 
from them. The present invention provides a novel 

approach for obtaining enzymes for further use, for 
example, for a wide variety of industrial applications, 
for medical applications, for packaging into kits for use 
as research reagents, etc. 

Thus, in accordance with the present invention, 
recombinant enzymes are generated from microorganisms and 
are classified by various enzyme characteristics. 

More particularly, one aspect of the present 
invention provides a process for identifying clones 
having a specified enzyme activity, which process 
comprises: 

screening for said specified enzyme activity in a 
library of clones prepared by 

(i) selectively isolating target DNA, from DNA 
derived from at least one microorganism, by use of at 
least one probe DNA comprising at least a portion of a 
DNA sequence encoding an enzyme having the specified 
enzyme activity; and 

(ii) transforming a host with isolated target 
DNA to produce a library of clones which are screened, 
preferably for the specified enzyme activity, using an 
activity library screening or nucleic acid library 
screening protocol . 

In a preferred embodiment of this aspect, DNA 
obtained from at least one microorganism is selected by 
recovering from the DNA, DNA which spcifically binds, 
such as by hybridization, to a probe DNA sequence. The 



-2- 

Rl IRSTITi fTP SWPFT mi II F 0R\ 



WO 97/20918 PCT/US96/19457 

DNA obtained from tne microorganism or microorganisms can 
be genomic DNA or genomic gene library DNA. One could 
even use DNA prepared for vector ligation, for instance. 
The probe may be directly or indirectly bound to a solid 
phase by which it is separated from the DNA which is not 
hybridized or otherwise specifically bound to the probe. 
The process can also include releasing DNA from said 
probe after recovering said hybridized or otherwise bound 
DNA and amplifying the DNA so released. 

The invention also provides for screening of the 
expression libraries for gene cluster protein product (s) 
and, more particularly, to obtaining selected gene 
clusters from DNA of a prokaryote or eukaryote. and to 
screening of an expression library for a desired activity 
of a protein of related activity (ies) of a family of 
proteins which results from expression of the selected 
gene cluster DNA of interest. 

More particularly, one embodiment of this aspect 
provides a process for identifying clones having a 
specified protein (s) activity, which process comprises 
screening for said specified enzyme activity in the 
library of clones prepared by (i) selectively isolating 
target gene cluster DNA, from DNA derived from at least 
one organism, by use of at least one probe polynucleotide 
comprising at least a portion of a polynucleotide 
sequence complementary to a DNA sequence encoding the 
protein (s) having the specified activity of interest; and 
(ii) transforming a host with isolated target gene 
cluster DNA to produce a library of such clones which are 
screened for the specified activity of interest. For 
example, if one is using DNA in a lambda vector one could 
package the DNA and infect cells via this route. 

In a particular embodiment of this aspect, gene 
cluster DNA obtained from the genomic DNA of the 
organism (s) is selected by recovering from the DNA, DNA 
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which specifically binds, such as by hybridization, to a 
probe DNA sequence. The polynucleotide probe may be 
directly or indirectly bound to a solid phase by which it 
is separated from the DNA which is not hybridized or 
otherwise specifically bound to the probe. This 
embodiment of this aspect of the process of the invention 
can also include releasing bound DNA from said probe 
after recovering said hybridized or otherwise bound DNA 
and amplifying the DNA so released. 

Another aspect of the invention provides a process 
for obtaining an enzyme having a specified enzyme 
activity derived from a heterogeneous DNA population, 
which process comprises screening, for the specified 
enzyme activity, a library of clones containing DNA from 
the heterogeneous DNA population which have been exposed 
to directed mutagenesis towards production of the 
specified enzyme activity. 

Another aspect of the invention provides a process 
for obtaining an enzyme having a specified enzyme 
activity, which process comprises: screening, for the 
specified enzyme activity, a library of clones containing 
DNA from a pool of DNA populations which have been 
exposed to directed mutagenesis in an attempt to produce 
in the library of clones DNA encoding an enzyme having 
one or more desired characteristics, which can be the 
same or different from the specified enzyme activity. In 
a preferred embodiment, the DNA pool which is subjected 
to directed mutagenesis is a pool of DNA which has been 
selected to encode enzymes having at least one enzyme 
characteristic, in particular at least one common enzyme 
activity. 

Also provided is a process for obtaining a protein 
having a specified activity derived from a heterogeneous 
population of gene clusters by screening, for the 
specified protein activity, a library of clones 
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containing gene clusters from the heterogeneous gene 
cluster population which have been exposed to directed 
mutagenesis towards production of specified protein 
activities of interest. 

Also provided is a process of obtaining a gene 
cluster protein product having a specified activity, by 
screening, for the specified protein activity, a library 
of clones containing gene clusters from a pool of gene 
cluster populations which have been exposed to directed 
mutagenesis to produce in the library of clones gene 
clusters encoding proteins having one or more desired 
characteristics, which can be the same or different from 
the specified protein activity. Preferably, the pool of 
gene clusters which is subjected to directed mutagenesis 
is one which has been selected to encode proteins having 
enzymatic activity in the synthesis of at least one 
therapeutic, prophylactic or physiological regulatory 

activity. 

The process of either of these aspects can further 
comprise, prior to the directed mutagenesis, selectively 
recovering from the heterogeneous population of gene 
clusters, gene clusters which comprise polycistronic 
sequences coding for proteins having at least one common 
physical, chemical or functional characteristic which can 
be the same or different from the activity observed prior 
to directed mutagenesis. Preferably, recovering the gene 
cluster preparation comprises contacting the gene cluster 
population with a specific binding partner, such as a 
solid phase-bound hybridization probe, for at least a 
portion of the gene cluster of interest. The common 
characteristic of the resultant protein (s) can be classes 
of the types of activity specified above, i.e., such as a 
series of enzymes related as parts of a common synthesis 
pathway or proteins capable of hormonal, signal 
transduction or inhibition of metabolic pathways or their 
functions in pathogens and the like. The gene cluster 
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DNA is recovered from clones containing such gene cluster 
DNA from the heterogeneous gene cluster population which 
exhibit the activity of interest. Preferably, the 
directed mutagenesis is site-specific directed 
mutagenesis. This process can further include a step of 
pre -screening the library of clones for an activity, 
which can be the same or different from the specified 
activity of interest, prior to exposing them to directed 
mutagenesis. This activity can result, for example, from 
the expression of a protein or related family of proteins 
of interest . 

The process of any of these aspects can further 
comprise, prior to said directed mutagenesis, selectively 
recovering from the heterogeneous DNA population DNA 
which comprises DNA sequences coding for enzymes having 
at least one common characteristic, which can be the same 
or different from the specified enzyme activity. 
Preferably, recovering the DNA preparation comprises 
contacting the DNA population with a specific binding 
partner, such as a solid phase bound hybridization probe, 
for at least a portion of the coding sequences. The 
common characteristic can be, for example, a class of 
enzyme activity, such as hydrolase activity. DNA is 
recovered from clones containing DNA from the 
heterogeneous DNA population which exhibit the class of 
enzyme activity. Preferably, the directed mutagenesis is 
site- specif ic directed mutagenesis. The process of this 
aspect can further include a step of prescreening the 
library of clones for an activity, which can be the same 
or different from the specified enzyme activity, prior to 
exposing them to directed mutagenesis. This activity can 
result, for example, from the expression of a protein of 
interest . 

The heterogeneous DNA population from which the DNA 
library is derived is a complex mixture of DNA, such as 
is obtained, for example, from an environmental sample. 
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Such samples can contain unculturable, uncultured or 
cultured multiple or single organisms. These 
environmental samples can be obtained from, for example, 
Arctic and Antarctic ice, water or permafrost sources, 
materials of volcanic origin, materials from soil or 
plant sources in tropical areas, etc. A variety of known 
techniques can be applied to enrich the environmental 
sample for organisms of interest, including differential 
culturing, sedimentation gradient, affinity matrices, 
capillary electrophoresis, optical tweezers and 
fluorescence activated cell sorting. The samples can 
also be cultures of a single organism. 

Thermostable enzymes are enzymes that function at 
greater than 60°C. Thermostable enzymes are utilized in 
both industry and biomedical research in assays where 
certain steps of the assay are performed at significantly 
increased temperatures. Thermostable enzymes may be 
obtained from thermophilic organisms found in hot 
springs, volcanic origin, tropical areas etc. Examples 
of such organisms, for instance, include prokaryotic 
microorganisms, such as eubacteria and archaebacteria 
{Bronneomerier , K. and Staudenbauer, W. L., D.R. Woods 
(ed) , the Clostridia and Biotechnology, Butterworth 
Publishers, Stoneham, M.A. (1993), among other organisms. 

Thermostable enzymes exhibit greater storage life 
capacity and organic solvent resistance, as compared to 
their mesophilic counterparts. 

There are applications in industry and in research 
for thermostable enzymes which exhibit enzyme activity at 
a desired minimum temperature. An example of this occurs 
in molecular diagnostics wherein reporter molecules must 
survive long term storage at room temperature or higher 
or they need to function in unusual environments, and the 
assays which employ them are performed at room 



SUBSTITUTE SHEET (RULE 26) 



WO 97/20918 

PCT/US96/19457 

temperature where the activity of thermostable enzymes is 
generally very low. 

Thus, another embodiment of the invention provides a 
process for providing a thermostable enzyme having 
improved enzyme activities at lower temperatures, said 
enzyme being a member selected from the group consisting 
of an enzyme or a polynucleotide encoding said enzyme 
comprising.- 

(a) subjecting to mutagenesis at least one enzyme 
which is stable at a temperature of at least 60»C ; and 

(b) screening mutants produced in (a) for a mutated 
enzyme or polynucleotide encoding a mutated enzyme, which 
mutated enzyme is stable at a temperature of at least 
60°C and which has an enzyme activity at a temperature of 
less than 50°C and which has activity greater than the 
enzyme of step (a) . 

These and other aspects of the present invention 
will be apparent to those skilled in the art from the 
teachings herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A and IB. Figure 1A is a photograph of an 
agarose gel containing standards and samples a-f 
described in Example 2. Samples c-f represent DNA 
recovered from a genomic DNA library using two specific 
DNA probes and amplified using gene specific primers, as 
described in Example 2. Figure IB is also a photograph 
of an agarose gel containing standards and samples a-f 
described in Example 2. Samples c-f represent DNA 
recovered from a genomic DNA library using two specific 
DNA probes and amplified using vector specific primers, 
as described in Example 2. 

Figure 2 is a photograph of four colony 
hybridization plates. Plates A and B showed positive 
clones i.e., colonies which contained DNA prepared in 
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accordance with the present invention, also contained 
probe sequence. Plates C and D were controls and showed 
no positive clones. 

Figure 3 illustrates the full length DNA sequence 
and corresponding deduced amino acid sequence of 
Thermococcus 9N2 Beta-glycosidase . 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The terms "derived" or "isolated" means that 
material is removed from its original environment (e.g., 
the natural environment if it is naturally occurring) . 
For example, a naturally- occurring polynucleotide or 
polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide separated from 
some or all of the coexisting materials in the natural 
system, is isolated. 

The term "error-prone PCR" refers to a process for 
performing PCR under conditions where the copying 
fidelity of the DNA polymerase is low, such that a high 
rate of point mutations is obtained along the entire 
length of the PCR product. Leung, D.W., et al . , 
Technique, 1:11-15 (1989) and Caldwell, R.C. & Joyce 
G.F., PCR Methods Applic, 2:28-33 (1992). 

The term "oligonucleotide directed mutagenesis" 
refers to a process which allows for the generation of 
site -specific mutations in any cloned DNA segment of 
interest. Reidhaar-Olson, J.F. & Sauer, R.T., et a J * , 
Science, 241:53-57 (1988). 

The term "assembly PCR" refers to a process which 
involves the assembly of a PCR product from a mixture of 
small DNA fragments. A large number of different PCR 
reactions occur in parallel in the same vial, with the 
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products of one reaction priming the products of another 
reaction. 



The term "sexual PCR mutagenesis" refers to forced 
homologous recombination between DNA molecules of 
different but highly related DNA sequence in vitro, 
caused by random fragmentation of the DNA molecule based 
on sequence homology, followed by fixation of the 
crossover by primer extension in a PCR reaction. 
Stemmer, W.P., PNAS, USA, 91:10747-10751 (1994). 

The term "in vivo mutagenesis" refers to a process 
of generating random mutations in any cloned DNA of 
interest which involves the propogation of the DNA in a 
strain of E. coli that carries mutations in one or more 
of the DNA repair pathways. These "mutator" strains have 
a higher random mutation rate than that of a wild- type 
parent. Propogating the DNA in one of these strains will 
eventually generate random mut?ati<5hs within the DNA. 

The term "cassette mutagenesis" refers to any 
process for replacing a small region of a double stranded 
DNA molecule with a synthetic oligonucleotide "cassette" 
that differs from the native sequence. The 
oligonucleotide often contains completely and/or 
partially randomized native sequence. 

The term "recursive ensemble mutagenesis" refers to 
an algorithm for protein engineering (protein 
mutagenesis) developed to produce diverse populations of 
phenotypically related mutants whose members differ in 
amino acid sequence. This method uses a feedback 
mechanism to control successive rounds of combinatorial 
cassette mutagenesis. Arkin, A. P. and Youvan, D.C., 

PNAS, USA, 89:7811-7815 (1992). 

The term "exponential ensemble mutagenesis" refers 
to a process for generating combinatorial libraries with 
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a high percentage of unique and functional mutants, 
wherein small groups of residues are randomized in 
parallel to identify, at each altered position, amino 
acids which lead to functional proteins, Delegrave, S. 
and Youvan, D.C., Biotechnology Research, 11:1548-1552 
(1993); and random and site-directed mutagenesis , Arnold, 
F.H., Current Opinion in Biotechnology, 4:450-455 (1993). 
All of the references mentioned above are hereby 
incorporated by reference in their entirety. 

As described with respect to one of the above 
aspects, the invention provides a process for enzyme 
activity screening of clones containing selected DNA 
derived from a microorganism which process comprises: 

screening a library for specified enzyme activity, 
said library including a plurality of clones, said clones 
having been prepared by recovering from DNA of a 
microorganism selected DNA, which DNA is selected by 
hybridization to at least one DNA sequence which is all 
or a portion of a DNA sequence encoding an enzyme having 
the specified activity; and 

transforming a host with the selected DNA to produce 
clones which are screened for the specified enzyme 
activity. 

In one embodiment, a DNA library derived from a 
microorganism is subjected to a selection procedure to 
select therefrom DNA which hybridizes to one or more 
probe DNA sequences which is all or a portion of a DNA 
sequence encoding an enzyme having the specified enzyme 
activity by: 

(a) rendering the double -stranded DNA population 
into a single -stranded DNA population; 

(b) contacting the single-stranded DNA population 
of (a) with the DNA probe bound to a ligand under 
conditions permissive of hybridization so as to produce a 
double -stranded complex of probe and members of the DNA 
population which hybridize thereto; 
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(c) contacting the double-stranded complex of (b) 
with a solid phase specific binding partner for said 
ligand so as to produce a solid phase complex; 

(d) separating the solid phase complex from the 
single-stranded DNA population of (b) ; 

(e) releasing from the probe the members of the 
population which had bound to the solid phase bound 
probe; 

(f ) forming double -stranded DNA from the members of 
the population of (e) ; 

(g) introducing the double -stranded DNA of (f) into 
a suitable host to form a library containing a plurality 
of clones containing the selected DNA; and 

(h) screening the library for the specified enzyme 
activity. 

In another embodiment, a DNA library derived from a 
microorganism is subjected to a selection procedure to 
select therefrom double -stranded DNA which hybridizes to 
one or more probe DNA sequences which is all or a portion 
of a DNA sequence encoding an enzyme having the specified 
enzyme activity by: 

(a) contacting the double -stranded DNA population 
with the DNA probe bound to a ligand under conditions 
permissive of hybridization so as to produce a complex of 
probe and members of the DNA population which hybridize 
thereto; 

(b) contacting the complex of (a) with a solid 
phase specific binding partner for said ligand so as to 
produce a solid phase complex; 

(c) separating the solid phase complex from the 
unbound DNA population of (b) ; 

(d) releasing from the probe the members of the 
population which had bound to the solid phase bound 
probe ; 

(e) introducing the double- stranded DNA of (d) into 
a suitable host to form a library containing a plurality 
of clones containing the selected DNA; and 
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(f) screening the library for the specified enzyme 
activity. 



In another aspect, the process includes a 
preselection to recover DNA including signal or secretion 
sequences. In this manner it is possible to select from 
the DNA population by hybridization as hereinabove 
described only DNA which includes a signal or secretion 
sequence. The following paragraphs describe the protocol 
for this embodiment of the invention, the nature and 
function of secretion signal sequences in general and a 
specific exemplary application of such sequences to an 
assay or selection process. 

Another particularly preferred embodiment of this 
aspect further comprises, after (a) but before (a) above, 
the steps of : 

(i) . contacting the double- stranded DNA population 
of (a) with a ligand-bound oligonucleotide probe that is 
complementary to a secretion signal sequence unique to" a 
given class of proteins under conditions permissive of 
hybridization to form a double -stranded complex; 

(ii) . contacting the complex of (a i) with a solid 
phase specific binding partner for said ligand so as to 
produce a solid phase complex; 

(Hi) separating the solid phase complex from the 
unbound DNA population; 

(iv) releasing the members of the population which 
had bound to said solid phase bound probe; and 

(v) separating the solid phase bound probe from the 
members of the population which had bound thereto. 

The DNA which has been selected and isolated to 
include a signal sequence is then subjected to the 
selection procedure hereinabove described to select and 
isolate therefrom DNA which binds to one or more probe 
DNA sequences derived from DNA encoding an enzyme (s) 
having the specified enzyme activity. 



-13- 

SUBSTITUTE SHEET (RULE 26) 



WO 97/20918 

PCT/US96/194S7 

The pathways by which proteins are sorted and 
transported to their proper cellular location are often 
referred to as protein targeting pathways. One of the 
most important elements in all of these targeting systems 
is a short amino acid sequence at the amino terminus of a 
newly synthesized polypeptide called the signal sequence. 
This signal sequence directs a protein to its appropriate 
location in the cell and is removed during transport or 
when the protein reaches its final destination. Most 
lysosomal, membrane, or secreted proteins have an amino- 
terminal signal sequence that marks them for 
translocation into the lumen of the endoplasmic 
reticulum. More than 100 signal sequences for proteins 
in this group have been determined. The sequences vary 
in length from 13 to 36 amino acid residues. 

A phoA expression vector, termed pMG, which, like 
TaphoA, is useful in identifying genes encoding membrane- 
spanning sequences or signal peptides. Giladi et al . , j 
Bacterid., 175 (13) .- 4129-4136 , 1993. This cloning system 
has been modified to facilitate the distinction of outer 
membrane and periplasmic alkaline phosphatase <AP) fusion 
proteins from inner membrane AP fusion proteins by 
transforming pMG recombinants into E. coli KS330, the 
strain utilized in the "blue halo" assay first described 
by Strauch and Beckwith, Proc. Nat. Acad. Sci. USA, 
85:1576-1580, 1988. The pMG/KS330r cloning and 
screening approach can identify genes encoding proteins 
with clevable signal peptides and therefore can serve as 
a first step in the identification of genes encoding 
polypeptides of interest. 

Another embodiment of the invention provides a 
process for obtaining an enzyme having a specified enzyme 
activity, derived from a heterogeneous DNA population by 
screening, for the specified enzyme activity, a library 
of clones containing DNA from the heterogeneous DNA 
population which have been exposed to directed 



-14- 



WO 97/2091 8 PCT/US96/1 9457 

mutagenesis towards production of the specified enzyme 
activity. The process can further comprise, prior to 
said directed mutagenesis, selectively recovering from 
the heterogeneous DNA population DNA which comprises DNA 
sequences coding for a common characteristic, which can 
be the same or different from the specified enzyme 
activity- The common characteristic can be, for example, 
a class of enzyme activity. This involves recovering DNA 
from clones containing DNA from the heterogeneous DNA 
population which exhibit the class of enzyme activity. 

In this embodiment, recovering the DNA preparation 
preferably is done by contacting the DNA population with 
a specific binding partner, such as a solid phase bound 
hybridization probe, for at least a portion of the coding 
sequences . 

The process of this embodiment can further comprise 
prescreening said library of clones for an activity, 
which can be the same or different from the specified 
enzyme activity, prior to exposing them to directed 
mutagenesis. The prescreening of said clones can be, for 
example, for the expression of a protein of interest. 

Another embodiment of the invention provides a 
process for obtaining an enzyme having a specified enzyme 
activity, which process comprises screening, for the 
specified enzyme activity, a library of clones containing 
DNA from a pool of DNA populations which have been 
exposed to directed mutagenesis, which can be site- 
specific, in an attempt to produce in the library of 
clones DNA encoding an enzyme having one or more desired 
characteristics which can be the same or different from 
the specified enzyme activity. The process of this 
embodiment can further include, prior to said directed 
mutagenesis, selectively recovering from the 
heterogeneous DNA population DNA which comprises DNA 
sequences coding for at least one common enzyme 
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characteristic, which can be the same or different from 
the specified enzyme activity. 

In this embodiment, recovering the DNA preparation 
can comprise contacting the DNA population with a 
specific binding partner for at least a portion of the 
coding sequences . The specific binding partner is a 
solid- phase bound hybridization probe. DNA is recovered 
from clones containing DNA from the heterogeneous DNA 
population which exhibit the class of enzyme activity. 

Applicant has found that it is possible to provide 
thermostable enzymes which have improved activity at 
lower temperatures. More particularly, Applicant has 
found that the activity of thermophilic enzymes can be 
improved at lower temperatures while maintaining the 
temperature stability of such enzymes. Further, it has- 
been found that there can be obtained a thermostable 
enzyme with improved activity at lower temperature by 
subjecting to mutagenesis a thermostable enzyme or 
polynucleotide encoding such thermostable enzyme followed 
by a screening of the resulting mutants to identify a 
mutated enzyme or a mutated polynucleotide encoding a 
mutated enzyme, which mutated enzyme retains 
thermostability and which has an enzyme activity at 
lower temperatures which is at least two (2) times 
greater than a corresponding non-mutated enzyme. 

The thermostable enzymes and mutated thermostable 
enzymes are stable at temperatures up to 60°C and 
preferably are stable at temperatures of up to 70°C and 
more preferably at temperatures up to 95°C and higher. 

Increased activity of mutated thermostable enzymes 
at lower temperatures is meant to encompass activities 
which are at least two-fold, preferably at least four- 
fold, and more preferably at least ten-fold greater than 
that of the corresponding wild-type enzyme. 
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Increased enzyme activity at lower temperatures 
means that enzyme activity is increased at a temperature 
below 50°C, preferably below 40°C and more preferably 
below 30°C. Thus, in comparing enzyme activity at a 
lower temperature between the mutated and non -mutated 
enzyme, the enzyme activity of the mutated enzyme at 
defined lower temperatures is at least 2 times greater 
than the enzyme activity of the corresponding non-mutated 
enzyme . 

Thus, lower temperatures and lower temperature 
ranges include temperatures which are at least 5°C less 
than the temperature at which thermostable enzymes are 
stable, which includes temperatures below 55°C, 50°C, 
45°C, 40°C, 35°C, 30°C, 25°C and 20°C, with below 50°C 
being preferred, and below 40 being more preferred, and . 
below 30° C (or approximately room temperature) being most 
preferred. 

In accordance with this aspect of the invention, the 
lower temperature or lower temperature range at which a 
greater enzyme activity is desired is determined and a 
thermostable enzyme (s), or polynucleotide encoding such 
enzyme (s), are subjected to mutagenesis and the resulting 
mutants are screened to determine mutated enzymes (or 
polynucleotide encoding mutated enzymes) which retain 
thermostability and which have a minimum desired increase 
in enzyme activity at the desired temperature or 
temperature range . 

Thermostable enzymes are enzymes which have 
activity, i.e. are not degraded, at temperatures above 
60°C. Thermostable enzymes also have increased storage 
life, and high resistance to organic solvents. 

Thermostable enzymes may be isolated from 
thermophilic organisms such as those which are found in 
elevated temperatures such as in hot springs, volcanic 
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areas and tropical areas. Examples of thermophilic 
organisms are prokaryotic organisms for example, 
thermophilic bacteria such as eubacteria and 
archaebacteria . 

The DNA from these thermostable organisms can then 
be isolated by available techniques that are described in 
the literature. The IsoQuick* nucleic acid extraction kit 
(MicroProbe Corporation) is suitable for this purpose. 

Alternatively, enzymes not known to have 
thermostable properties can be screened for such 
properties by inserting the DNA encoding the enzyme in an 
expression vector and transforming a suitable host as 
hereinafter described, such that the enzyme may be 
expressed and screened for positive thermostable 
activity. 

The isolated DNA encoding a thermostable enzyme is 
subjected to mutagenesis techniques, with the preferred 
type of mutagenesis techniques being set forth herein/ 

As can be seen from the above-defined mutagenesis 
techniques, the DNA encoding an enzyme having the desired 
activity may be subject to mutagenesis alone, i.e. as 
naked DNA, or the DNA may be subjected to mutagenesis 
after insertion into an appropriate vector as described 
herein. These techniques are referred to as in vitro 
mutagenesis . 

Alternatively, in vivo mutagenesis may be performed 
wherein the DNA is subjected to mutagenesis while it is 
within a cell or living organism. A preferred example of 
this technique utilizes the XL1 Red Strain of E. coli 
(Stratagene, Inc.) which has its DNA repair genes, MutH 
MutL and MutS, deleted such that many different mutations 
occur in a short time. Up to 10,000 mutations may take 
place within a 30 hour time span such that an entire 
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mutated DNA library may be prepared from mutated DNA by 
procedures known in the art. 



After an appropriate amount of time to allow 
mutations to take place, the mutated DNA is excised from 
the host cell in the case of in vivo mutagenesis and 
inserted in another appropriate vector and used to 
transform a non -mutator host, for example, XL1 Blue 
strain of E. coli after which a mutated DNA library is 
prepared- In the case of in vitro mutagenesis, if the 
mutated DNA has previously been inserted in an 
appropriate expression vector, said vector is then used 
directly to transform an appropriate non-mutator host for 
the preparation of a mutated DNA library, if the 
mutagenized DNA is not in an appropriate expression 
vector. 

A library is prepared for screening by transforming 
a suitable organism. Hosts, particularly those 
specifically identified herein as preferred, are 
transformed by artificial introduction of the vectors 
containing the mutated DNA by inoculation under 
conditions conducive for such transformation. 

The resultant libraries of transformed clones are 
then screened for clones which display activity for the 
enzyme of interest in a phenotypic assay for enzyme 
activity . 

For example, having prepared a multiplicity of 
clones from DNA mutagenized by one of the techniques 
described above, such clones are screened for the 
specific enzyme activity of interest. 

For example, the clones containing the mutated DNA 
are now subject to screening procedures to determine 
their activity within both higher temperatures and within 
the desired lower temperature range to identify mutants 
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which have the desired increase in activity within the 
lower temperature range when compared to the 

corresponding wild-type thermostable enzyme which is non 
mutated. 



Positively identified clones, i.e. those which 
contain mutated DNA sequences which express thermostable 
enzymes which are thermostable and yet have. an increased 
activity at least two times than the corresponding wild- 
type enzyme at temperatures within the lower temperature 
range, are isolated and sequenced to identify the DNA 
sequence. As an example, phosphatase activity at the 
desired lower temperature ranges may be identified by 
exposing the clones, and thus the thermostable enzyme and 
testing its ability to cleave an appropriate substrate. 

In Example 7 phosphatase and 0-galactosidase 
activity are measured by comparing the wild-type enzymes 
to the enzymes subjected to mutagenesis. As can be seen 
from the results reported there, mutagenesis of a wild- 
type phosphatase and /3-galactosidase thermophilic enzyme 
produce mutated enzymes which were 3 and 2.5 times more 
active, respectively, at lower temperatures than the 
corresponding wild-type enzymes within the lower 
temperature range of room temperature. 

In the case of protein engineering, after subjecting 
a thermophilic enzyme to mutagenesis, the mutagenized 
enzyme is screened for the desired activity namely, 
increased activity at lower temperatures while 
maintaining activity at the higher temperatures. Any of 
the known techniques for protein mutagenesis may be 
employed, with particularly preferred mutagenesis 
techniques being those discussed above. 



The DNA derived from a microorganism (s) is 
preferably inserted into an appropriate vector (generally 
a vector containing suitable regulatory sequences for 
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effecting expression) prior to subjecting such DNA to a 
selection procedure to select and isolate therefrom DNA 
which hybridizes to DNA derived from DNA encoding an 
enzyme (s) having the specified enzyme activity. 

The microorganisms from which the libraries may be 
prepared include prokaryotic microorganisms, such as 
Eubacteria and Archaebacteria, and lower eukaryotic 
microorganisms such as fungi, some algae and protozoa. 
The microorganisms may be cultured microorganisms or 
uncultured microorganisms obtained from environmental 
samples and such microorganisms may be extremophiles , 
such as thermophiles, hyperthermophiles, psychrophiles, 
psychrotrophs , etc . 

As indicated above, the library may be produced from 
environmental samples in which case DNA may be recovered- 
without culturing of an organism or the DNA may be 
recovered from a cultured organism. 

Sources of microorganism DNA as a starting material 
library from which target DNA is obtained are 
particularly contemplated to include environmental 
samples, such as microbial samples obtained from Arctic 
and Antarctic ice, water or permafrost sources, materials 
of volcanic origin, materials from soil or plant sources 
in tropical areas, etc. Thus, for example, DNA may be 
recovered from either a culturable or non-culturable 
organism and employed to produce an appropriate 
recombinant expression library for subsequent 
determination of enzyme activity. 

Bacteria and many eukaryotes have a coordinated 
mechanism for regulating genes whose products are 
involved in related processes. The genes are clustered, 
in structures referred to as "gene clusters," on a single 
chromosome and are transcribed together under the control 
of a single regulatory sequence, including a single 
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promoter which initiates transcription of the entire 
cluster. The gene cluster, the promoter/ and additional 
sequences that function in regulation altogether are 
referred to as an "operon" and can include up to 20 or 
more genes, usually from 2 to 6 genes. Thus, a gene 
cluster is a group of adjacent genes that are either 
identical or related, usually as to their function. 

Some gene families consist of identical members. 
Clustering is a prerequisite for maintaining identity 
between genes, although clustered genes are not 
necessarily identical. Gene clusters range from extremes 
where a duplication is generated to adjacent related 
genes to cases where hundreds of identical genes lie in a 
tandem array. Sometimes no significance is discernable 
in a repetition of a particular gene. A principal 
example of this is the expressed duplicate insulin genes 
in some species, whereas a single insulin gene is 
adequate in other mammalian species. 

It is important to further research gene clusters 
and the extent to which the full length of the cluster is 
necessary for the expression of the proteins resulting 
therefrom. Further, gene clusters undergo continual 
reorganization and, thus, the ability to create 
heterogeneous libraries of gene clusters from, for 
example, bacterial or other prokaryote sources is 
valuable in determining sources of novel proteins, 
particularly including enzymes such as, for example, the 
polyketide synthases that are responsible for the 
synthesis of polyketides having a vast array of useful 
activities. Other types of proteins that are the 
product (s) of gene clusters are also contemplated, 
including, for example, antibiotics, antivirals, 
antitumor agents and regulatory proteins, such as 
insulin. 
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Polyketides are molecules which are an extremely 
rich source of "bioactivities, including antibiotics (such 
as tetracyclines and erythromycin) , anti- cancer agents 

(daunomycin) , immunosuppressants (FK506 and rapamycin) , 
and veterinary products (monensin) . Many polyketides 

(produced by polyketide synthases) are valuable as 
therapeutic agents . Polyketide synthases are 
multifunctional enzymes that catalyze the biosynthesis of 
a hugh variety of carbon chains differing in length and 
patterns of functionality and cyclization. Polyketide 
synthase genes fall into gene clusters and at least one 
type (designated type I) of polyketide synthases have 
large size genes and enzymes, complicating genetic 
manipulation and in vitro studies of these 
genes/proteins . 

The ability to select and combine desired components 
from a library of polyketides and postpolyketide 
biosynthesis genes for generation of novel polyketides 
for study is appealing. The method (s) of the present 
invention make it possible to and facilitate the cloning 
of novel polyketide synthases, since one can generate 
gene banks with clones containing large inserts 
(especially when using the f -factor based vectors) , which 
facilitates cloning of gene clusters. 

Preferably, the gene cluster DNA is ligated into a 
vector, particularly wherein a vector further comprises 
expression regulatory sequences which can control and 
regulate the production of a detectable protein or 
protein- related array activity from the ligated gene 
clusters . Use of vectors which have an exceptionally 
large capacity for exogenous DNA introduction are 
particularly appropriate for use with such gene clusters 
and are described by way of example herein to include the 
f -factor (or fertility factor) of E. coli. This f -factor 
of E. coli is a plasmid which affect high-frequency 
transfer of itself during conjugation and is ideal to 
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achieve and stably propagate large DNA fragments, such as 
gene clusters from mixed microbial samples. 

The DNA can then be isolated by available techniques 
that are described in the literature. The IsoQuick* 
nucleic acid extraction kit (MicroProbe Corporation) is 
suitable for this purpose. 

The DNA isolated or derived from these 
microorganisms can preferably be inserted into a vector 
or a plasmid prior to probing for selected DNA. Such 
vectors or plasmids are preferably those containing 
expression regulatory sequences, including promoters, 
enhancers and the like. Such polynucleotides can be part 
of a vector and/or a composition and still be isolated, 
in that such vector or composition is not part of its 
natural environment . Particularly preferred phage or 
plasmid and methods for introduction and packaging into 
them are described in detail in the protocol set forth 
herein. 

The following outlines a general procedure for 
producing libraries from both culturable and non- 
culturable organisms, which libraries can be probed to 
select therefrom DNA sequences which hybridize to 
specified probe DNA: 

ENVIRONMENTAL SAMPLE 

Obtain Biomass 

DNA Isolation (various methods) 

Shear DNA (25 gauge needle) 

Blunt DNA (Mung Bean Nuclease) 

Methylate DNA (EcoR I Methylase) 

Ligate to EcoR I linkers (GGAATTCC) 

Cut back linkers (BcoR I Restriction Endonuclease) 

Size Fractionate (Sucrose Gradient) 

Ligate to lambda vector (lambda Zap II and gtll) 

Package {in vitro lambda packaging extract) 
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Clones having an enzyme activity of interest are 
identified by screening. This screening can be done 
either by hybridization, to identify the presence of DNA 
coding for the enzyme of interest or by detection of the 
enzymatic activity qf interest. 

The probe DNA used for selectively isolating the 
target DNA of interest from the DNA derived from at least 
one microorganism can be a full-length coding region 
sequence or a partial coding region sequence of DNA for 
an enzyme of known activity. The original DNA library 
can be preferably probed using mixtures of probes 
comprising at least a portion of DNA sequences encoding 
enzymes having the specified enzyme activity. These 
probes or probe libraries are preferably single -stranded 
and the microbial DNA which is probed has preferably been 
converted into single- stranded form. The probes that are 
particularly suitable are those derived from DNA encoding 
enzymes having an activity similar or identical to the 
specified enzyme activity which is to be screened. 

The probe DNA should be at least about 10 bases and 
preferably at least 15 bases. In one embodiment, the 
entire coding region may be employed as a probe. 
Conditions for the hybridization in which target DNA is 
selectively isolated by the use of at least one DNA probe 
will be designed to provide a hybridization stringency of 
at least about 50% sequence identity, more particularly a 
stringency providing for a sequence identity of at least 
about 70%, preferably 75%. 

Hybridization techniques for probing a microbial DNA 
library to isolate target DNA of potential interest are 
well known in the art and any of those which are 
described in the literature are suitable for use herein, 
particularly those which use a solid phase-bound, 
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directly or indirectly bound, to probe DNA for separation 
from the remainder of the DNA derived from the 
microorganisms. Solution phase hybridizations followed 
by binding of the probe to a solid phase is preferable. 

Preferably the probe DNA is "labeled" with one 
partner of a specific binding pair (i.e. a ligand) and 
the other partner of the pair is bound to a solid matrix 
to provide ease of separation of target from its source. 
The ligand and specific binding partner can be selected 
from, in either orientation, the following.- (l) an 
antigen or hapten and an antibody or specific binding 
fragment thereof; (2) biotin or iminobiotin and avidin 
or streptavidin; (3) a sugar and a lectin specific 
therefor; (4) an enzyme and an inhibitor therefor,- (5) 
an apoenzyme and cofactor; (6) complementary 
homopolymeric oligonucleotides,- and (7) a hormone and a 
receptor therefor. The solid phase is preferably 
selected from: (l) a glass or polymeric surface; (2) a 
packed column of polymeric beads; and (3) magnetic or 
paramagnetic particles. 

The library of clones prepared as described above 
can be screened directly for enzymatic activity without 
the need for culture expansion, amplification or other 
supplementary procedures. However, in one preferred 
embodiment, it is considered desirable to amplify the DNA 
recovered from the individual clones such as by PCR. 

Further, it is optional but desirable to perform an 
amplification of the target DNA that has been isolated. 
In this embodiment the target DNA is separated from the 
probe DNA after isolation. It is then amplified before 
being used to transform hosts. The double stranded DNA 
selected to include as at least a portion thereof a 
predetermined DNA sequence can be rendered single 
stranded, subjected to amplification and reannealed to 
provide amplified numbers of selected double stranded 
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DNA. Numerous amplification methodologies are now well 
known in the art. 



The selected DNA is then used for preparing a 
library for screening by transforming a suitable 
organism. Hosts, particularly those specifically 
identified herein as preferred, are transformed by 
artificial introduction of the vectors containing the 
target DNA by inoculation under conditions conducive for 
such transformation. One could transform with double 
stranded circular or linear DNA or there may also be 
instances where one would transform with single stranded 
circular or linear DNA. 

The resultant libraries of transformed clones are 
then screened for clones which display activity for the 
enzyme of interest in a phenotypic assay for enzyme 
activity . 

Having prepared a multiplicity of clones from DNA " 
selectively isolated from an organism, such clones are 
screened for a specific enzyme activity and to identify 
the clones having the specified enzyme characteristics. 

The screening for enzyme activity may be effected on 
individual expression clones or may be initially effected 
on a mixture of expression clones to ascertain whether or 
not the mixture has one or more specified enzyme 
activities. If the mixture has a specified enzyme 
activity, then the individual clones may be rescreened 
for such enzyme activity or for a more specific activity. 
Thus, for example, if a clone mixture has hydrolase 
activity, then the individual clones may be recovered and 
screened to determine which of such clones has hydrolase 
activity. 

As representative examples of expression vectors 
which may be used there may be mentioned viral particles, 
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baculovirus, phage, plasmids, phagemids, cosmids, 
phosmids, bacterial artificial chromosomes, viral DNA 
(e.g. vaccinia, adenovirus, foul pox virus, pseudorabies 
and derivatives of SV40) , Pl-based artificial 
chromosomes, yeast plasmids, yeast artificial 
chromosomes, and any other vectors specific for specific 
hosts of interest (such as bacillus, aspergili us , yeast 
etc.) Thus, for example, the DNA may be included in any 
one of a variety of expression vectors for expressing a 
polypeptide. Such vectors include chromosomal, 
nonchromosomal and synthetic DNA sequences. Large 
numbers of suitable vectors are known to those of skill 
in the art, and are commercially available. The 
following vectors are provided by way of example,- 
Bacterial: pQE70, pQE60, pQ E -9 (Qiagen), psiX174,' 
pBluescript SK, pBluescript KS (Stratagene); pTRC99a, 
PKK223-3, pKK233-3, pDR540, pRIT2T (Pharmacia); 
Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTl, pSG ' 
(Stratagene), pSVK3, pBPV, pMSG, pSVLSV40 (Pharmacia) 
However, any other plasmid or vector may be used as long 
as they are replicable and viable in the host. 

A particularly preferred type of vector for use in 
the present invention contains an f -factor origin 
replication. The f-factor (or fertility factor) in B. 
coli is a plasmid which effects high frequency transfer 
of itself during conjugation and less frequent transfer 
of the bacterial chromosome itself. A particularly 
preferred embodiment is to use cloning vectors , referred 
to as "fosmids" or bacterial artificial chromosome (BAC) 
vectors. These are derived from E. coli f-factor which 
is able to stably" integrate large segments of DNA. When 
integrated with DNA from a mixed uncultured environmental 
sample, this makes it possible to achieve large genomic 
fragments in the form of a stable "environmental DNA 
library. " 
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The DNA derived from a microorganism (s) may be 
inserted into the vector by a variety of procedures. In 
general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures known in 
the art. Such procedures and others are deemed to be 
within the scope of those skilled in the art. 

The DNA sequence in the expression vector is 
operatively linked to an appropriate expression control 
sequence (s) (promoter) to direct mRNA synthesis. 
Particular named bacterial promoters include lad, lacZ, 
T3, T7, gpt, lambda P R , P L and trp. Eukaryotic promoters 
include CMV immediate early, HSV thymidine kinase, early 
and late SV40, LTRs from. retrovirus, and mouse 
metallothionein-I . Selection of the appropriate vector 
and promoter is well within the level of ordinary skill 
in the art. The expression vector also contains a 
ribosome binding site for translation initiation and a 
transcription terminator. The vector may also include 
appropriate sequences for amplifying expression. 
Promoter regions can be selected from any desired gene 
using CAT (chloramphenicol transferase) vectors or other 
vectors with selectable markers. 

In addition, the expression vectors preferably 
contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells 
such as dihydrof olate reductase or neomycin resistance 
for eukaryotic cell culture, or such as tetracycline or 
ampicillin resistance in E. coli. 

Generally, recombinant expression vectors will 
include origins of replication and selectable markers 
permitting transformation of the host cell, e.g., the 
ampicillin resistance gene of E. coli and S. cerevisiae 
TRP1 gene, and a promoter derived from a highly -expressed 
gene to direct transcription of a downstream structural 
sequence. Such promoters can be derived from operons 
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encoding glycolytic enzymes such as 3 -phosphoglycerate 
kinase (PGK) , a-factor, acid phosphatase, or heat shock 
proteins, among others. The heterologous structural 
sequence is assembled in appropriate phase with 
translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic 
space or extracellular medium. 

The DNA selected and isolated as hereinabove 
described is introduced into a suitable host to prepare i 
library which is screened for the desired enzyme 
activity. The selected DNA is preferably already in a 
vector which includes appropriate control sequences 
whereby selected DNA which encodes for an enzyme may be 
expressed, for detection of the desired activity. The 
host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, 
such as a bacterial cell. Introduction of the construct 
into the host cell can be effected by calcium phosphate 
transfection, DEAE-Dextran mediated transf ection, or 
electroporation (Davis, L . , Dibner, M. , Battey, I., Basic 
Methods in Molecular Biology, (1986) ) 

As representative examples of appropriate hosts, 
there may be mentioned: bacterial cells, such as E. coli, 
Streptomyces, Salmonella typhimurium; fungal cells, such 
as yeast; insect cells such as Drosophila S2 and 
Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection 
of an appropriate host is deemed to be within the scope 
of those skilled in the art from the teachings herein. 

With particular references to various mammalian cell 
culture systems that can be employed to express 
recombinant protein, examples of mammalian expression 
systems include the COS -7 lines of monkey kidney 
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fibroblasts, described by Gluzman f Cell, 23:175 (1981), 
and other cell lines capable of expressing a compatible 
vector, for example, the C127, 3T3, CHO, HeLa and BHK 
cell lines. Mammalian expression vectors will comprise 
an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the 
SV4 0 splice, and polyadenylation sites may be used to 
provide the required nontranscribed genetic elements. 

Host cells are genetically engineered (transduced or 
transformed or transfected) with the vectors. The 
engineered host cells can be cultured in conventional 
nutrient media modified as appropriate -for activating 
promoters, selecting transf ormants or amplifying genes. 
The culture conditions, such as temperature, pH and the 
like, are those previously used with the host cell 
selected for expression, and will be apparent to the 
ordinarily skilled artisan. 

The library may be screened for a specified enzyme 
activity by procedures known in the art. For example, 
the enzyme activity may be screened for one or more of 
the six IUB classes; oxidoreductases, transferases, 
hydrolases, lyases, isomerases and ligases. The 
recombinant enzymes which are determined to be positive 
for one or more of the IUB classes may then be rescreened 
for a more specific enzyme activity. 

Alternatively, the library may be screened for a 
more specialized enzyme activity. For example, instead 
of generically screening for hydrolase activity, the 
library may be screened for a more specialized activity, 
i.e. the type of bond on which the hydrolase acts. Thus, 
for example, the library may be screened to ascertain 
those hydrolases which act on one or more specified 
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chemical functionalities, such as: (a) amide (peptide 
bonds), i.e. proteases; (b) ester bonds, i.e. esterases 
and lipases; (c) acetals, i.e., 9 lycosidases etc. 

The clones which are identified as having the 
specified enzyme activity may then be sequenced to 
identify the DNA sequence encoding an enzyme having the 
specified activity.. Thus, in accordance with the present 
invention it is possible to isolate and identify.- (i) DNA 
encodmg an enzyme having a specified enzyme activity 
Ui) enzymes having such activity (including the amino 
acid sequence thereof) and (iii) produce recombinant 
enzymes having such activity. 

Alternatively, clones found to have the enzymatic 
activity for which the screen was performed are sequenced 
and then subjected to directed mutagenesis to develop ne W 
enzymes with desired activities or to develop modified 
enzymes with particularly desired properties that are 
absent or less pronounced in the wild-type enzyme, such 
as stability to heat or organic solvents. Any of the 
known techniques for directed mutagenesis are applicable 
to the invention. For example, particularly preferred 
mutagenesis techniques for use in accordance with the 
invention include those discussed below. 

The present invention may be employed for example 
to identify or produce new enzymes having, for example,' 
the following activities which may be employed for the 

following uses: 

1 Lipase/Esterase 

a. Enantioselective hydrolysis of esters (lipids)/ 
thioesters 

1) Resolution of racemic mixtures 

2) Synthesis of optically active acids or 
alcohols from jneso-di esters 
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b. Selective syntheses 

1) Regiospecif ic hydrolysis of carbohydrate 

esters 

2) Selective hydrolysis of cyclic secondary 
alcohols 

c. Synthesis of optically active esters, lactones, 
acids, alcohols 

1) Transesterif ication of 
activated/nonactivated esters 

2) Interesterif ication 

3) Optically active lactones from hydroxyesters 

4) Regio- and enantioselective ring opening 
of anhydrides 

d. Detergents 

e. Fat/Oil conversion 

f . Cheese ripening 

2 Protease 

a. Ester/amide synthesis 

b. Peptide synthesis 

c. Resolution of racemic mixtures of amino acid 
esters 

d. Synthesis of non-natural amino acids 

e. Detergents/protein hydrolysis 

3 Glycosidase/Glycosyl transferase 

a. Sugar/polymer synthesis 

b. Cleavage of glycosidic linkages to form mono, 
di-and oligosaccharides 

c- Synthesis of complex oligosaccharides 

d. Glycoside synthesis using UDP-galactosyl 
transferase 

e. Transglycosylation of disaccharides, glycosyl 
fluorides, aryl galactosides 

f . Glycosyl transfer in oligosaccharide synthesis 

g. Diastereoselective cleavage of (3- 
glucosylsulf oxides 

h. Asymmetric glycosylations 

-33- . 
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i • Food processing 
j . Paper processing 

4 Phosphatase/Kinase 

a. Synthesis/hydrolysis of phosphate esters 

1) Regio-, enantioselective phosphorylation 

2) Introduction of phosphate esters 

3) Synthesize phospholipid precursors 

4) Controlled polynucleotide synthesis 

b. Activate biological molecule 

c. Selective phosphate bond formation without 
protecting groups 

5 Mono/Dioxygenas e 

a. Direct oxyf unctionalization of unactivated 
organic substrates 

b. Hydroxylation of alkane, aromatics, steroids 

c. Epoxidation of alkenes 

d. Enantioselective sulphoxidation 

e. Regio- and stereoselective Bayer-villi ger 
oxidations 

6 Haloperoxidase 

a. Oxidative addition of halide ion to 
nucleophilic sites 

b. Addition of hypohalous acids to olefinic bonds 
c Ring cleavage of cyclopropanes 

d. Activated aromatic substrates converted to 
ortho and para derivatives 

e. 1.3 diketones converted to 2 -halo-derivatives 
f - Heteroatom oxidation of sulfur and nitrogen 

containing substrates 
g. Oxidation of enol acetates, alkynes and 
activated aromatic rings 

7 Ligain peroxidase/Diarylpropane peroxidase 

a. Oxidative cleavage of C-C bonds 

b. Oxidation of benzylic alcohols to aldehydes 
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c. Hydroxylation of benzylic carbons 

d. Phenol dimerization 

e. Hydroxylation of double bonds to form diols 

f . Cleavage of lignin aldehydes 

8 Epoxide hydrolase 

a. Synthesis of enantiomerically pure bioactive 
compounds 

b. * Regio- and enantioselective hydrolysis of 

epoxide 

c . Aromatic and olef inic epoxidation by 
monooxygenases to form epoxides 

d. Resolution of racemic epoxides 

e. Hydrolysis of steroid epoxides 
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Nitrile hydratase/nitrilase 

a. Hydrolysis of aliphatic nitriles to 
c ar boxami de s 

b. Hydrolysis of aromatic, heterocyclic, 
unsaturated aliphatic nitriles to corresponding 
acids 

c . Hydrolysis of acrylonitrile 

d. Production of aromatic and carboxamides , 
carboxylic acids (nicotinamide, picolinamide, 
isonicotinamide) 

e. Regioselective hydrolysis of acrylic dinitrile 

f . a-amino acids from a-hydroxynitriles 



a. Transfer of amino groups into oxo-acids 



11 Amidase/Acylase 

a. Hydrolysis of amides, amidines, and other ON 
bonds 

b. Non-natural amino acid resolution and synthesis 



The following examples illustrate the invention but 
are not a limitation of its scope. 



-35- 

SUBSTITUTE SHEET (RULE 26) 



WO 97/20918 



PCT/US96/I9457 



Example 1 

Preparation of a math^i •; - n DNA r.jh-rar^, 

The following outlines the procedures used to 
generate a gene library from a sample of the exterior 
surface of a whale bone found at 124 0 meters depth in the 
Santa Catalina Basin during a div 2 e^edition. 

Isolate DNA. 

IsoQuick Procedure as per manufacturer's 
instructions. 



Shear DNA 



Vigorously push and pull DNA through a 25G 
double-hub needle and 1-cc syringes about 
500 times. 

Check a small amount (0.5 / ( g) on a 0.8% " 
agarose gel to make sure the majority of 
the DNA is in the desired size range 
(about 3-6 kb) . 



Blunt DNA 



(150 u/al) 



5 
6 

•-» 
/ 

S 
9 



Add 



and dry. 



h j° to a final volume of 405 /il 

45 Ml 10X Mung Bean Buffer 

2 '° M 1 Mung Bean Nuclease 

Incubate 37°c, 15 minutes. 
Phenol /chloroform extract once. 
Chloroform extract once. 

Add l ml ice cold ethanol to precipitate. 

Place on ice for 10 minutes. 

Spin in microfuge, high speed, 30 minutes. 

Wash with l ml 70% ethanol. 

Spin in microfuge, high speed, 10 minutes 
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Methylate DNA 

1. Gently resuspend DNA in 26 ^1 TE. 

2. Add: 

4.0 /il 10X EcoR I Methylase 



Buffer 



0.5 /il SAM (32 mM) . 

5.0 pi EcoR I Methylase (40 



u//xl) 



3. Incubate 37% 1 hour. 

Insure Blunt Ends 

1. Add to the methylation reaction: 

5. 0 /il 100 mM MgCl 2 

8.0 /il dNTP mix (2.5 raM of 

each dGTP, dATP, dTTP, 

dCTP) 

4.0 ^1 Klenow (5 u//il) 

2. Incubate 12°C, 30 minutes. 
3 . Add 450 fxl IX STE. 

4. Phenol /chloroform extract once. 

5. Chloroform extract once. 

6. Add 1 ml ice cold ethanpl to precipitate 
and place on ice for 10 minutes. 

7. Spin in microfuge, high speed, 3 0 minutes. 

8. Wash with' 1 ml 70% ethanol. 

9. Spin in microfuge, high speed, 10 minutes 
and dry . 

Linker Ligation 

1. Gently resuspend DNA in 7 fil Tris-EDTA 

(TE) . 

2 . Add : 

14 fil Phosphorylated EcoR I 

linkers (200 ng/^D 

3.0 /il 10X Ligation Buffer 

3.0 fil 10 mM rATP 

3.0 fil T4 DNA Ligase (4Wu/^l) 

3. Incubate 4°C, overnight. 
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EcdRl Cutback 

1. Heat kill ligation reaction 66°C, 10 

minutes. 

2 . Add : 

237.9 nl HoO 

30 f 11 10X EcoR I Buffer 

21 ^ ScoR I Restriction 

Enzyme (100 u/fil) 

3. incubate 37°c, 1.5 hours. 

4. Add 1.5 pi 0.5 M EDTA. 

5. Place on ice. 

Sucrose Gradient (2.2 ml) size Fractionation 

1. Heat sample to 65°c, 10 minutes. 

2. Gently load on 2.2 ml sucrose gradient. 

3. Spin in mini-ultracentrifuge, 45K, 20°C, 4 
hours (no brake) . 

4. Collect fractions by puncturing the bottom 
of the gradient tube with a 20G needle and 
allowing the sucrose to flow through the 
needle. Collect the first 2 0 drops in a 
Falcon 2059 tube then collect 10 1-drop 
fractions (labelled 1-10) . Each drop is 
about 60 (il in volume. 

5. Run 5 /il of each fraction on a 0.8% 
agarose gel to check the size. 

6. Pool fractions 1-4 (about 10-1.5 kb) and, 
in a separate tube, pool fractions 5-7 
(about 5-0.5 kb) . 

7. Add 1 ml ice cold ethanol to precipitate 
and place on ice for 10 minutes. 

8. Spin in microfuge, high speed, 30 minutes. 

9. Wash with l ml 70% ethanol. 

10. Spin in microfuge, high speed, 10 minutes 
and dry. 

11. Resuspend each in 10 /il TE buffer. 
Test Ligation to Lambda Arms 
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1 . Plate assay to get an approximate 
concentration. Spot 0.5 /il of the sample 
on agarose containing ethidium bromide 
along with standards {DNA samples of known 
concentration) . View in UV light and 
estimate concentration compared to the 
standards. Fraction 1-4 = >1.0 /zg/jzl . 
Fraction 5-7 = 500 ng/^1 . 

2. Prepare the following ligation reactions 
(5 /il reactions) and incubate 4°C, 
overnight : 











Lambda 










10X 




arms 




T4 DNA 






Ligase 


lOmM 


(gtll 


Insert 


Ligase 


Sample 


H 2 0 


Buffer 


rATP 


and 


DNA 


(4 










ZAP) 




Wu//i) 


Fraction 


0.5 


0.5/il 


0.5 fil 


1.0 /il 


2.0 fil 


0.5 /il 


1-4 


Pi 












Fraction 


0.5 


0.5 /il 


0.5/il 


1.0 /il 


2.0 /il 


0.5 fil 


5-7 


Ml 













Test Package and Plate 

1. Package the ligation reactions following 
manufacturer's protocol. Package 2.5 /il 
per packaging extract (2 extracts per 
ligation) . 

2 . Stop packaging reactions with 500 fil SM 
buffer and pool packaging that came from 
the same ligation. 

3. Titer 1.0 ill of each on appropriate host 
(OD 600 = 1. 0) [XLI-Blue MRF for ZAP and 
Y1088 for gtll] 

Add 200 /il host (in mM MgSOJ to 

Falcon 205 9 tubes 

Inoculate with 1 fil packaged phage 
Incubate 37°C, 15 minutes 
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Add about 3 ml 48 °C top agar 

[50 ml stock containing 150 /il 
IPTG (0.5M) and 300 /il X-GAL 
(350 mg/ml)] 

Plate on 100mm plates and incubate 

37°C, overnight. 
4. Efficiency results: 

9tll: 1.7 x 10 4 recombinants 

with 95% background 

ZAP II: 4.2 x 10 4 recombinants with 

66% background 

Contaminants in the DNA sample may have inhibited 
the enzymatic reactions, though the sucrose gradient 
and organic extractions may have removed them. 
Since the DNA sample was precious, an effort was 
made to "fix" the ends for cloning: 

Re -Blunt DNA 

1. Pool all left over DNA that was not 
ligated to the lambda arms (Fractions 1-7) 
and add H 2 0 to a final volume of 12 /il . 
Then add: 

143 /il H 2 0 

20 fil iox Buffer 2 (from 

Stratagene's cDNA 
Synthesis Kit) 

23 Ml Blunting dNTP (from 

Stratagene's cDNA 
Synthesis Kit) 

20 Ml Pfu (from Stratagene"s 

cDNA Synthesis Kit) 

2. Incubate 72° C, 30 minutes. 

3. Phenol /chloroform extract once. 

4 . Chloroform extract once . 

5. Add 20 fih 3M NaOAc and 400 /il ice cold 
ethanol to precipitate. 

6. Place at -20°C, overnight. 

7. Spin in microfuge, high speed, 30 minutes. 
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8. Wash with 1 ml 70% ethanol. 

9. Spin in microfuge, high speed, 10 minutes 

and dry. 

(Do NOT Methylate DNA since it was already methylated in 
the first round of processing) 

Adaptor Ligation 

1. Gently resuspend DNA in 8 /xl EcoR I 
adaptors (from Stratagene's cDNA Synthesis 
Kit) . 

2. Add: 

1.0 fil 10X Ligation Buffer 

1. 0 fil 10 mM rATP 

1.0 /zl T4 DNA Ligase (4Wu/pl) 

3. Incubate 4°C, 2 days. 

(Do NOT cutback since using ADAPTORS this time. Instead, 
need to phosphorylate) 

Phosphorylate Adaptors 

I. Heat kill ligation reaction 70°C, 30 

minutes . 

Add: 

1.0 /xl 10X Ligation Buffer 

2 . 0 (il lOmM rATF 

6.0 /il H 2 0 

1.0 /xl PNK (from Stratagene's 

cDNA Synthesis Kit) . 

3. Incubate 37°C, 30 minutes. 

4. Add 31 Ml H 2 0 and 5 pi 10X STE. 

5. Size fractionate on a Sephacryl S-500 spin 
column (pool fractions 1-3) . 

6. Phenol/chloroform extract once. 

7. Chloroform extract once. 

8. Add ice cold ethanol to precipitate. 

9. Place on ice, 10 minutes. 

10. Spin in microfuge, high speed, 30 minutes. 

II. Wash with 1 ml 70% ethanol. 
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12. Spin in microfuge, high speed, 10 minutes 

and dry. 

13. Resuspend in 10.5 fil TE buffer. 

Do not plate assay. Instead, ligate directly to arms as 
above except use 2.5 ftl of DNA and no water. 

Package and titer as above. 

Efficiency results: 

S rtll: 2.5 x 10 6 recombinants with 

2.5% background 

ZAP II: 9.6 X 10 5 recombinants with o 



background 



% 



Amplification of Libraries (5.0 x 10 s recombinants 
from each library) 

1. Add 3.0 ml host cells (OD 600 =1. 0 ) to two 50 
ml conical tube. 

2. inoculate with 2.5 X 10 s pfu per conical 

tube . 

3. Incubate 37°C, 20 minutes. 

4. Add top agar to each tube to a final 
volume of 45 ml . 

5 . Plate the tube across five 150 mm plates . 

6. Incubate 37°C, 6-8 hours or until plaques 
are about pin-head in size. 

7. Overlay with 8-10 ml SM Buffer and place 
at 4°C overnight (with gentle rocking if 
possible) . 

Harvest Phage 

1. Recover phage suspension by pouring the SM 
buffer off each plate into a 50 -ml conical 
tube. 

2. Add 3 ml chloroform, shake vigorously and 
incubate at room temperature, 15 minutes. 

3. Centrifuge at 2K rpm, 10 minutes to remove 
cell debris. 
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4. Pour supernatant into a sterile flask, add 
500 jil chloroform. 

5. Store at 4°C. 



Titer Amplified Library 

1. Make serial dilutions: 

10* 5 = 1 /il amplified phage in 1 ml SM 

Buffer 

10' 5 = 1 fil of the 10 3 dilution in 1 ml SM 

Buffer 

2. Add 200 /il host (in 10 mM MgSOJ to two 
tubes . 

3. Inoculate one with 10 /il 10" 6 dilution (10" 

5 ) . 

4 . Inoculate the other with l /il 10 6 dilution 

do- 6 ) . 

5. Incubate 37°C, 15 minutes. 

6. Add about 3 ml 48 °C top agar. 

[50 ml stock containing 150 /il IPTG 
(0.5M) and 375 ill X-GAL (350 mg/ml)] 

7. Plate on 100 mm plates and incubate 37°C, 

overnight . 

8. Results: 

gtll: 1. 7 x 10 u /ml 

ZAP II: 2.0 x 10 l0 /ml 
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Example 2 

Construction of a Sta b l e, Large Insert dna Lihr*^ 

P i c op 1 an k ton Genomi c DNA 

Cell collection and preparation of DNA. Agarose 
plugs containing concentrated picoplankton cells were 
prepared from samples collected on an oceanographic 
cruise from Newport, Oregon to Honolulu, Hawaii. 
Seawater (30 liters) was collected in Niskin bottles, 
screened through 10 pm Nitex, and concentrated by hollow 
fiber filtration (Amicon DC10) through 30,000 MW cutoff 
polyfulfone filters. The concentrated bacterioplankton 
cells were collected on a 0.22 pan, 47 mm Durapore filter, 
and resuspended in l ml of 2X STE buffer (im NaCl, 0.1M ' 
EDTA, 10 mM Tris, pH 8.0) to a final density of 
approximately l x 10'° cells per ml. The cell suspension 
was mixed with one volume of 1% molten Seaplaque LMP 
agarose (FMC) cooled to 40*C, and then immediately drawn 
into a 1 ml syringe. The syringe was sealed with 
parafilm and placed on ice for 10 min. The cell- 
containing agarose plug was extruded into 10 ml of Lysis 
Buffer (lOmM Tris pH 8.0, 50 mM NaCl, 0 . 1M EDTA, 1% 
Sarkosyl, 0.2% sodium deoxycholate, 1 mg/ml lysozyme) and 
incubated at 37°C for one hour. The agarose plug was 
then transferred to 40 mis of ESP Buffer (1% Sarkosyl, l 
mg/ml proteinase K, in 0.5M EDTA), and incubated at 55°C 
for 16 hours. The solution was decanted and replaced 
With fresh ESP Buffer, and incubated at 55°C for an 
additional hour. The agarose plugs were then placed in 
50 mM EDTA and stored at 4°C shipboard for the duration 
of the oceanographic cruise. 

One slice of an agarose plug (72 /il) prepared from a 
sample collected off the Oregon coast was dialyzed 
overnight at 4°C against 1 mL of buffer A (lOOmM NaCl, 
10mM Bis Tris Propane-HCl, 100 /zg/ml acetylated BSA: pH 
7.0 @ 25°C) in a 2 mL microcentrifuge tube. The solution 
was replaced with 250 ^1 of fresh buffer A containing 10 
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mM MgCl 2 and 1 mM DTT and incubated on a rocking platform 
for 1 hr at room temperature. The solution was then 
changed to 250 fil of the same buffer containing 4U of 
Sau3Al (NEB) , equilibrated to 37°C in a water bath, and 
then incubated on a rocking platform in a 37°C incubator 
for 4 5 min. The plug was transferred to a 1.5 ml 
microcentrifuge tube and incubated at 68°C for 30 min to 
inactivate the enzyme and to melt the agarose. The 
agarose was digested and the DNA dephosphorylased using 
Gelase and HK-phosphatase (Epicentre) , respectively, 
according to the manufacturer's recommendations. Protein 
was removed by gentle phenol /chloroform extraction and 
the DNA was ethanol precipitated, pelleted, and then 
washed with 70% ethanol. This partially digested DNA was 
resuspended in sterile H 2 0 to a concentration of 2 . 5 
ng/ptl for ligation to the pFOSl vector." 

PCR amplification results from several of the 
agarose plugs (data not shown) indicated the presence of 
significant amounts of archaeal DNA. Quantitative 
hybridization experiments using rRNA extracted from one 
sample, collected at 200 m of depth off the Oregon Coast, 
indicated that planktonic archaea in (this assemblage 
comprised approximately 4.7% of the total picoplankton 
biomass (this sample corresponds to "PACI"-200 m in Table 
l of DeLong et al . , high abundance of Archaea in 
Antarctic marine picoplankton, Nature, 371:695-698, 
1994) . Results from archaeal -biased rDNA PCR 
amplification performed on agarose plug lysates confirmed 
the presence of relatively large amounts of archaeal DNA 
in this sample. Agarose plugs prepared from this 
picoplankton sample were chosen for subsequent fosmid 
library preparation. Each 1 ml agarose plug from this 
site contained approximately 7.5 x 10 5 cells, therefore 
approximately 5.4 x 10 5 cells were present in the 72 fil 
slice used in the preparation of the partially digested 
DNA. 
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Vector arms were prepared from pFOSl as described 
(Kim et al., Stable propagation of casmid sized human DNA 
inserts in an F factor based vector, Nucl . Acids Res., 
20:10832-10835, 1992). Briefly, the plasmid was 
completely digested with Astll, dephosphqrylated with HK 
phosphatase, and then digested with BamHI to generate two 
arms, each of which contained a cos site in the proper 
orientation for cloning and packaging ligated DNA between 
35-45 kbp. The partially digested picoplankton DNA was 
ligated overnight to the PFOSl arms in a 15 /xl ligation 
reaction containing 25 ng each of vector and insert and 
1U of T4 DNA ligase (Boehringer-Mannheim) . The ligated 
DNA in four microliters of this reaction was in vitro 
packaged using the Gigapack XL packaging system 
(Stratagene) , the fosmid particles transf ected to E. coli 
strain DH10B (BRL) , and the cells spread onto LB „ 

** cm 15 

plates. The resultant fosmid clones were picked into 96- 
well microliter dishes containing LB crolJ supplemented with 
7% glycerol. Recombinant fosmids, each containing ca. 40 
kb of picoplankton DNA insert, yielded a library of 3.552 
fosmid clones, containing approximately 1.4 x 10* base 
pairs of cloned DNA. All of the clones examined 
contained inserts ranging from 38 to 42 kbp. This 
library was stored frozen at -80«C for later analysis. 

Example 3 

Hybridization Selecti on and Production of Expression 

Library 

Starting with a plasmid library prepared as 
described in Example 1, hybridization selection and 
preparation of the expression library were performed 
according to the protocol described in this example. The 
library can contain DNA from isolated microorganisms, 
enriched cultures or environmental samples. 



Single-stranded DNA is made in one of two ways: 1) 
The plasmid library can be grown and the double -stranded 
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plasmid DNA isolated. The double -stranded DNA is made 
single-stranded'using Fl gene II protein and Exonuclease 
III. The gene II protein nicks the double -stranded 
plasraids at the Fl . origin and the Exo III digests away 
the nicked strand leaving a single -stranded circle. This 
method is used by Life Technologies in their GeneTrapper™ 
kit; 2) the second method involves the use of a helper 
phage to "rescue" one of the strands of the double - 
stranded plasmids . The plasmid library is grown in a 
small overnight culture. A small aliquot of this is 
mixed with VCS-M13 helper phage and again grown 
overnight. The next morning the phagemids (virus 
particles containing single -stranded DNA) are recovered 
from the media and used in the following protocol. 

PROTOCOL 

1. Six samples of 4 /ig of rescued, single -stranded DNA - 
from library #17 were prepared in 3X SSC buffer. 
Final reaction volumes were 30 /zl. 

2. To these solutions was added one of the following: 

a) nothing 

b) 100 ng of biotinylated probe from an 
unrelated sequence 

c,d) 100 ng of biotinylated probe from organism 

#13 DNA polymerase gene 
e,f) 100 ng of biotinylated probe from organism 

#17 DNA polymerase gene 
Biotinylated probes were prepared by PCR 
amplification of fragments of -1300 bp in length 
coding for a portion of the DNA polymerase gene of 
these organisms. The amplification products were 
made using biotinylated dUTP in the amplification 
mix. This modified nucleotide is incorporated 
throughout the DNA during synthesis. Unincorporated 
nucleotides were removed using the QIAGEN PCR Clean- 
up kit . 
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3. These mixtures were denatured by heating to 95°c for 
2 minutes . 

4. Hybridization was performed for 90 minutes at 70°C 
for samples a, b, d and f . Samples c and e were 
hybridized at 60°C. 

5. 50 Ml of washed and blocked MPG beads were added and 
mixed to each sample. These mixtures were agitated 
every 5 minutes for a total of 30 minutes. MPG 
beads are sent at 1 mg/ml in buffer containing 
preservative so 6 sets of 100 fil were washed 2 times 
in 3X SSC and resuspended in 60 fil of 3X SSC 
containing 100 fig of sonicated salmon sperm DNA. 

6. The DNA/bead mixtures were washed 2 times at room 
temperature in 0.1X SSC/ 0.1% SDS, 2 times at 42°C in 
0.1X SSC/0.1% SDS for 10 minutes each and 1 
additional wash at room temperature with 3X SSC. 



7. 
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The bound DNA was eluted by heating the beads to 
70°C for 15 minutes in 50 /zl TE. 



8. Dilutions of the eluted DNAs were made and PCR 
amplification was performed with either gene 
specific primers or vectors specific primers. 
Dilutions of the library DNA were used as standards 

9. The DNA inserts contained within the DNA were 
amplified by PCR using vector specific primers. 
These inserts were cloned using the TA Cloning 
system (Invitrogen) . 



Duplicates of 92 white colonies and 4 blue colonies 
from samples d and f were grown overnight and colony 
lifts were prepared for Southern blotting. 
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11. The digoxigenin system from Boehringer Mannheim was 
used to probe the colonies using the organism #17 
probe . 

RESULTS 

PCR Quantitation 

Figures 1A and IB. Figure 1A is a photograph of the 
autoradiogram resulting from the Southern hybridization 
agarose gel electrophoresis columns of DNA from sample 
solutions a-f in Example 2, when hybridized with gene 
specific primers. Figure IB is a photograph of the 
autoradiogram resulting from the Southern hybridization 
agarose gel electrophoresis columns of DNA from sample 
solutions a-f in Example 2, when hybridized with vector 
specific primers. 

The gene specific DNA amplifications of samples a 
and b demonstrate that non-specific binding to the beads 
is minimal. The amount of DNA bound under the other 
conditions results in the following estimates of 
enrichment . 



gene specific equivalent 



total 



enrichme 
nt 



c 
d 
e 
f 



50 ng 
50 ng 
20 ng 
20 ng 



100 pg 
3 0 pg 
50 pg 
20 pg 



500X 
1667X 

400X 
1000X 



Colony Hybridization 

Figure 2 is a photograph of four colony 
hybridization plates resulting from Plates A and B show 
positive clones i.e., colonies containing sequences 
contained in the probe and which contain DNA from a 
library prepared in accordance with the invention. 
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Plates C and D were controls and showed no positive 
clones . 



Seven of 92 colonies from the panned sample were 
positive for sequences contained in the probe. No 
positive clones were found in the unpanned sample. 

Example 4 
Enzymatic Activity Assay 

The following is a representative example of a 
procedure for screening an expression library, prepared 
in accordance with Example l, for hydrolase activity. 

Plates of the library prepared as described in 
Example 1 are used to multiply inoculate a single plate 
containing 200 of LB Amp/Meth, glycerol in each well. 
This step is performed using the High Density Replicating 
Tool (HDRT) of the Beckman Biomek with a 1% bleach, 
water, isopropanol, air-dry sterilization cycle between 
each inoculation. The single plate is grown for 2h at 
37°C and is then used to inoculate two white 96 -well 
Dynatech microtiter daughter plates containing 250 /*L of 
LB Amp/Meth, glycerol in each well. The original single 
plate is incubated at 37°c for 18h, then stored at -80°C. 
The two condensed daughter plates are incubated at 3 7°C 
also for 18 h. The condensed daughter plates are then 
heated at 70°C for 45 min. to kill the cells and 
inactivate the host E. coli enzymes. A stock solution of 
5mg/mL morphourea phenylalanyl-7-amino-4-trif luoromethyl 
coumarin (MuPheAFC, the 'substrate') in DMSO is diluted 
to 600 pM with 50 mM pH 7.5 Hepes buffer containing 0.6 
mg/mL of the detergent dodecyl maltoside. 
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MuPheAFC 

Fifty fih of the 600 MuPheAFC solution is added to 
each of the wells of the white condensed plates with one 
100 /iL mix cycle using the Biomek to yield a final 
concentration of substrate of ~ 100 pM. The fluorescence 
values are recorded (excitation = 4 00 nm, emission = 505 
nm) on a plate reading fluorometer immediately after 
addition of the substrate <t=0) . The plate is incubated 
at 70°C for 100 min, then allowed to cool to ambient 
temperature for 15 additional minutes. The fluorescence 
values are recorded again (t=100) . The values at t = 0 are 
subtracted from the values at t=l00 to determine if an 
active clone is present. 

The data will indicate whether one of the clones in 
a particular well is hydrolyzing the substrate. In order 
to determine the individual clone which carries the 
activity, the source library plates are thawed and the 
individual clones are used to singly inoculate a new 
plate containing LB Amp/Meth, glycerol. As above, the 
plate is incubated at 37°C to grow the cells, heated at 
70°C to inactivate the host enzymes, and 50 /iL of 600 /iM 
MuPheAFC is added using the Biomek. 

After addition of the substrate the t=0 fluorescence 
values are recorded, the plate is incubated at 70°C, and 
the t=l00 min. values are recorded as above. These data 
indicate which plate the active clone is in. 

The enantioselectivity value, E, for the substrate 
is determined according to the equation below: 

ln[ (i-c(l+ee p ) ] 
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E= 

ln[ (l-c(l-ee p )] 
where ee,, = the enantiomeric excess (ee) of the 
hydrolyzed product and c = the percent conversion of the 
reaction. See Wong and Whitesides, Enzymes in Synthetic 
Organic Chemistry, 1994, Elsevier, Tarrytown, New York 
pp. 9-12. 

The enantiomeric excess is determined by either 
chiral high performance liquid chromatography (HPLC) or 
chiral capillary electrophoresis (CE) . Assays are 
performed as follows : two hundred M L of the appropriate 
buffer is added to each well of a 96 -well white 
microtiter plate, followed by 50 fih of partially or 
completely purified enzyme solution,- 50 fxL of substrate 
is added and the increase in fluorescence monitored 
versus time until 50% of the substrate is consumed or the 
reaction stops, whichever comes first. 

Example 5 

Mutagenesis of Positi™ T 2 nzvme Ae n v itv cion** 
Mutagenesis was performed on two different enzymes 
(alkaline phosphatase and 6-glycosidase) , using the two 
different strategies described here, to generate new 
enzymes which exhibit a higher degree of activity than 
the wild-type enzymes. 

Alkaline Phosp hataao 

The XLl-Red strain (Stratagene) was transformed with 
genomic clone 27a3a (in plasmid pBluescript) encoding the 
alkaline phosphatase gene from the organism 0C9a 
according to the manufacturer's protocol. A 5ml culture 
of lb + o.i mg/ml ampicillin was inoculated with 200^1 of 
the transformation. The culture was allowed to grow at 
37°C for 30 hours. A miniprep was then performed on the 
culture, and screening was performed by transforming 2 M 1 
of the resulting DNA into XL-l Blue cells (Stratagene) 
according to the manufacturer's protocol and following 
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procedure outlined below (after "Transform XL1 Blue 
cells) . The mutated OC9a phosphatase took 10 minutes to 
develop color and the wild type enzyme took 30 minutes to 
develop color in the screening assay. 

Standard Alkaline Phosphatase Screening Assay 

Transform XL1 Red strain -> Inoculate 5ml LB/amp 
culture with 200//1 transformation and incubate at 37°C 
for 30 hours -» Miniprep DNA -» Transform XL1 Blue cells -* 
Plate on LB/amp platesr* Lift colonies with Duralon UV 
(Stratagene) or HATF (Millipore) membranes -* Lyse in 
chloroform vapors for 30 seconds -> Heat kill for 30 
minutes at 85°C Develop filter at room temperature in 
BCIP buffer -> Watch as filter develops and identify and 
pick fastest developing colonies ("positives") -» Restreak 
"positives" onto a BCT-P plate 

BCIP Buffer: 

20mm CAPS pH 9 . Of* 
1mm MgCl? ; -* r 
0.01 mm ZnCl 2 
0.1 mg/ml BCIP 

Beta-Glvcosidase 

This protocol was used to mutagenize Thermococcus 
9N2 Beta-Glycosidase. 
PCR Reaction 

2 microliters dNTP's (lOmM Stocks) 
10 microliters lOxPCR Buffer 

.5 microliters Vector DNA-31G1A-100 nanograms 

20 microliters 3' Primer (100 pmol) 

2 0 microliters 5' Primer (100 pmol) 

16 microliters MnCl 4H 2 0 (1.25mM Stock) 

24.5 microliters H 2 0 

1 microliter Tag Polymerase (5.0 Units) 
100 microliters total 
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Reaction Cycle 

95 °C 15 seconds 
58 °C 30 seconds 
72 °C 90 seconds 

25 cycles (10 minute extension at 72°C-4°c 
incubation) 



Run 5 microliters on a 1% agarose gel to check 
the reaction. 

Purify on a Qiaquick column (Qiagen) . 
Resuspend in 50 microliters H,0. 

Restriction Digest 

25 microliters purified PCR product 
10 microliters NEB Buffer #2 
3 microliters Kpn I (lOU/microliter) 
3 microliters EcoRl (20U/microliter) 
59 microliters H-.0 

Cut for 2 hours at 37°C. 

Purify on a Qiaquick column (Qiagen) . 

Elute with 35 microliters H,0. 

Ligation 

10 microliters Digested PCR product 

5 microliters Vector (cut with EcoRi/Kpnl and 

phosphatased with shrimp alkaline phosphatase 

4 microliters 5x Ligation Buffer 

1 microliter T4 DNA Ligase (BRL) 

Ligate overnight . 



Transform into M15pREP4 cells using 
electroporation. 



Plate 100 or 200 microliters onto LB amp meth 
kan plates, grow overnight at 37 degrees 
celsius . 
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Beta-Glvcosidase Assay 

Perform glycosidase assay to screen for mutants as 
follows. The filter assay uses buffer Z (see recipe 
below) containing 1 mg/ml of the substrate 5-bromo-4- 
chloro- 3 - indolyl - 6 -o-glucopyranoside (XGLU) (Diagnostic 
Chemicals Limited or Sigma) . 

Z-Buffer: (referenced in Miller, J.H. (1992) A 
Short Course in Bacterial Genetics, p. 445.) 
per liter: 

Na,HP0 4 -7H 2 0 16.1 g 

NaH 2 P0 4 -H 2 0 5.5 g 

KC1 0.75. g 

MgS0 4 -7H 3 0 0.246 g 

6-mercaptoethanol 2 . 7 ml 

Adjust pH to 7.0 

(1) Perform colony lifts using Millipore HATF 
membrane filters. 

(2) Lyse colonies with chloroform vapor in 150 mm 
glass petri dishes. 

(3) Transfer filters to 100 mm glass petri dishes 
containing a piece of Whatman 3 MM filter paper saturated 
with Z buffer containing 1 mg/ml XGLU. After 
transferring filter bearing lysed colonies to the glass 
petri dish, maintain dish at room temperature. 

(4) "Positives" were observed as blue spots on the 
filter membranes ("positives" are spots which appear 
early) . Use the following filter rescue technique to 
retrieve plasmid from lysed positive colony. Use pasteur 
pipette (or glass capillary tube) to core blue spots on 
the filter membrane. Place the small filter disk in an 
Epp tube containing 20 fil water. Incubate the Epp tube 
at 75°C for 5 minutes followed by vortexing to elute 
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Transform this DNA into 
cells. Repeat filter-lift assay 
to identify "positives." Return 
37°C incubator after filter lift 
Inoculate 3 ml LBamp liquid with 
incubate at 37°C overnight, 
these cultures and sequence 



Example 7 

Directed Mutagenesis of Posi t ive En 2y m * Activity Hnr, aa 

Directed mutagenesis was performed on two different 
enzymes (alkaline phosphatase and 6-glycosidase) , using 
the two different strategies described here, to generate 
new enzymes which exhibit a higher degree of activity at 
lower temperatures than the wild-type enzymes. 

Alkaline Phosphatase 

The XLl-Red strain (Stratagene) was transformed with 
DNA encoding an alkaline phosphatase (in plasmid 
pBluescript) from the organism OC9a according to the 
manufacturer's protocol. A 5ml culture of LB + 0.1 mg/ml 
ampicillin was inoculated with 200/il of the 
transformation. The culture was allowed to grow at 37°C 
for 30 hours. A miniprep was then performed on the 
culture, and screening was performed by transforming 2/xl 
of the resulting DNA into XL-l Blue cells (Stratagene) 
according to the manufacturer's protocol. 

Standard Alkaline Phosphata se Screening Assay 

-» Plate on LB/amp plates Lift colonies with 
Duralon UV (Stratagene) or HATF (Millipore) membranes -* 
Lyse in chloroform vapors for 30 seconds -» Heat kill for 
30 minutes at 85»C -* Develop filter at room temperature 
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in BCIP buffer -» Watch as filter develops and identify 
and pick fastest developing colonies ("positives") -» 
Restreak "positives" onto a BCIP plate. 



BCIP Buffer; 

20mm CAPS pH 9.0 

1mm MgCl 2 

0 . 01 mm ZnCl 2 

0.1 mg/ml BCIP 
The mutated OC9a phosphatase took 10 minutes to develop 
color and the wild type enzyme took 30 minutes to develop 
color in the screening assay. 

Beta-Glvcosidase 

This protocol was used to mutagenize DNA encoding 
Thermococcus 9N2 Beta-Glycosidase . This DNA sequence is 
set forth in Figure 1. 

PCR 

2 microliters dNTP's (lOmM Stocks) 
10 microliters lOxPCR Buffer 
.5 microliters pBluescript vector containing 
Beta-glycosidase DNA (100 nanograms) 

20 microliters 3' Primer (100 pmol) 
2 0 microliters 5' Primer (100 pmol) 
16 microliters MnCl 4H 2 0 (1.25mM Stock) 
24 . 5 microliters H 2 0 

1 microliter Tag Polymerase (5.0 Units) 
100 microliters total 



Reaction Cycle 

95°C 15 seconds 
58° C 30 seconds 
72°C 90 seconds 

25 cycles (10 minute extension at 72°C-4°C 
incubation) 

Run 5 microliters on a 1% agarose gel to check 
the reaction. 



-57- 

e*i in n ti n rrr t*ucer mi n E nc\ 



WO 97/20918 

PCT/US96/194S7 

Purify on a Qiaquick column (Qiagen) . 
Resuspend in 50 microliters H 2 0. 

Restriction Dig est: 

25 microliters purified PCR product 
10 microliters NEB Buffer #2 
3 microliters Kpn I (lOU/microliter) 
3 microliters EcoRl (20U/microliter) 
59 microliters H,0 

Cut for 2 hours at 37°C. 

Purify on a Qiaquick column (Qiagen) . 

Elute with 35 microliters H,0 . 

Ligation 

10 microliters Digested PCR product 

5 microliters pBluescript Vector (cut with 

EcoRI/Kpnl and phosphatased with shrimp 

alkaline phosphatase) 

4 microliters 5x Ligation Buffer 

1 microliter T4 DNA Ligase (BRL) 

Ligate overnight. 

Transform into Ml5pREP4 cells using 
electroporation . 

Plate 100 or 200 microliters onto LB amp meth 
kan plates, grow overnight at 37 degrees 
Celsius. 



Beta-Glycosidase Assay 

Perform glycosidase assay to screen for mutants as 
follows. The filter assay uses buffer Z (see recipe 
below) containing l mg/ml of the substrate 5-bromo-4- 

chloro-3-indolyl-6-o-glucopyranoside (XGLU) (Diagnostic 
Chemicals Limited or Sigma) . 
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Z-Buffer: (referenced in Miller, J.H. (1992) A 
Short Course in Bacterial Genetics, p. 445.) 
per liter: 

Na 2 HP0 4 -7H 2 0 16.1 g 

NaH>P0 4 -H 2 0 5.5 g 

KC1 0.75 g 

MgS0 4 -7H,0 0.246 g 

6-mercaptoethanol 2 . 7 ml 
Adjust pH to 7.0 

(1) Perform colony lifts using Millipore HATF 
membrane filters. 

(2) Lyse colonies with chloroform vapor in 150 mm 
glass petri dishes. 

(3) Transfer filters to 100 mm glass petri dishes 
containing a piece of Whatman 3 MM filter paper saturated 
with Z buffer containing 1 mg/ml XGLU. After 
transferring filter bearing lysed colonies to the glass 
petri dish, maintain dish at room temperature. 

(4) "Positives" were observed as blue spots on the 
filter membranes ("positives" are spots which appear 
early) . Use the following filter rescue technique to 
retrieve plasmid from lysed positive colony. Use pasteur 
pipette (or glass capillary tube) to core blue spots on 
the filter membrane. Place the small filter disk in an 
Epp tube containing 20 jil water. Incubate the Epp tube 
at 75°C for 5 minutes followed by vortexing to elute 
plasmid DNA off filter. Transform this DNA into 
electrocompetent E. coli cells. Repeat filter-lift assay 
on transformation plates to identify "positives." Return 
transformation plates to 37°C incubator after filter lift 
to regenerate colonies. Inoculate 3 ml LBamp liquid with 
repurified positives and incubate at 37°C overnight. 
Isolate plasmid DNA from these cultures and sequence 
plasmid insert . 
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The /3-glycosidase subjected to mutagenesis acted on 
XGLU 2.5 times more efficiently than wild-type 0- 
glycosidase . 

Numerous modifications and variations of the present 
invention are possible in light of the. above teachings; 
therefore, within the scope of the claims, the invention 
may be practiced other than as particularly described. 
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What Is Claimed Is: 

1. A process for identifying clones having a 
specified enzyme activity, which process comprises: 

screening for said specified enzyme activity in a 
library of clones prepared by 

* (i) selecting and recovering target nucleic 
acid from a nucleic acid population derived from at least 
two microorganisms, by use of at least one nucleic acid 
or nucleic acid like hybridizing probe comprising at 
least a portion of a nucleic acid sequence encoding a 
protein having a specified activity; and 

(ii) transforming a host with recovered target 
nucleic acid to produce a library of clones comprised of 
a subset of the original nucleic acid population which 
are screened for a specified activity. 

2 . The process of claim 1 wherein the nucleic acid 

* 

obtained from the nucleic acid of the microorganism is 
selected by: 

converting double stranded nucleic acid into single 
stranded nucleic acid; 

recovering from the converted single stranded 
nucleic acid target single stranded nucleic acid which 
hybridizes to probe nucleic acid; and 

converting recovered single stranded target nucleic 
acid to double stranded nucleic acid for transforming the 
host . 

3. The process of claim 2 wherein the probe is 
directly or indirectly bound to a solid phase by which it 
is separated from single stranded nucleic acid which is 
not so hybridized. 

4. The process of claim 3 which further comprises 

the steps of : 

releasing single stranded nucleic acid from said 

probe ; and 
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amplifying £he single stranded nucleic acid so 
released prior to converting it to double stranded 
nucleic acid. 



5. The process of Claim l wherein said target 
nucleic acid encodes a gene cluster or portion thereof. 

6. A process in which a nucleic acid library 
derived from at least one microorganism is subjected to t 
selection procedure to select therefrom double -stranded 
nucleic acid which hybridizes to one or more probe 
nucleic acid sequences which is all or a portion of a 
nucleic acid sequence encoding a protein having the 
specified protein activity, which process comprises: 

(a) contacting the double -stranded nucleic acid 
population with the nucleic acid probe bound to a ligand 
under conditions permissive of hybridization so as to 
produce a complex of probe and members of the nucleic 
acid population which hybridize thereto; 

(b) contacting the complex of (a) with a solid 
phase specific binding partner for said ligand so as to 
produce a solid phase complex; 

(c) separating the solid phase complex from the 
unbound nucleic acid population of (b) ,- 

(d) releasing from the probe the members of the 
population which had bound to the solid phase bound 
probe ; 

(e) introducing the double -stranded nucleic acid of 
(d) into a suitable host to form a library containing a 

plurality of clones containing the selected nucleic acid; 
and 

(f ) screening the library for the specified protein 
activity. 

7. The process of Claim 6 wherein said target 
nucleic acid encodes a gene cluster or portion thereof. 
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8. The process of Claim 6 which further comprises, 
before (a) above, the steps of: 

(i) . contacting the double -stranded nucleic acid 
population of (a) with a ligand-bound oligonucleotide 
probe that is complementary to consensus sequences in (i) 
unique to a given class of proteins under conditions 
permissive of hybridization to form a double -stranded 
complex; 

(ii) . contacting the complex of (i) with a solid 
phase specific binding partner for said ligand so as to 
produce a solid phase complex; 

(Hi) separating the solid phase complex from the 
unbound nucleic acid population; 

(iv) releasing the members of the population which 
had bound to said solid phase bound probe; and 

(v) separating the solid phase bound probe from the 
members of the population which had bound thereto. 

9. A process for obtaining a protein having a 
specified protein activity derived from a heterogeneous 
nucleic acid population, which process comprises: 

screening, for the specified enzyme activity, a 
library of clones containing nucleic acid from the 
heterogeneous nucleic acid population which have been 
modified or mutagenized towards production of the 
specified activity. 

10. The process of claim 9 which further comprises, 
prior to said mutagenesis, selectively recovering from 
the heterogeneous nucleic acid population nucleic acid 
which comprises nucleic acid sequences coding for a 
common characteristic, which can be the same or different 
from the specified activity. 

11. The process of claim 10 wherein recovering the 
nucleic acid preparation comprises contacting the nucleic 
acid population with a specific binding partner for at 
least a portion of the sequences. 
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12. The process of claim n wherein the specific 
binding partner is a solid phase bound hybridization 
probe . 



13 . The process of claim 10 wherein the common 
characteristic is a class of enzyme activity. 

14. The process of claim 13 which comprises 
recovering nucleic acid from clones containing nucleic 
acid from the heterogeneous nucleic acid population which 
exhibit the class of enzyme activity. 

15. The process of claim 9 wherein the mutagenesis 
is site-specific or random directed mutagenesis. 

16. The process of claim 9 which further comprises 
prescreening said library of clones for an activity, 
which can be the same or different from the specified 
activity, prior to exposing them to mutagenesis. 

17. The process of claim 16 which comprises 
prescreening said clones for the expression of a protein 
of interest . 

18 . A process for obtaining a protein having a 
specified protein activity, which process comprises: 

screening, for the specified activity, a library of 
clones containing nucleic acid from a pool of nucleic 
acid populations which have been exposed to mutagenesis 
in an attempt to produce in the library of clones nucleic 
acid encoding a protein having one or more desired 
characteristics which can be the same or different from 
the specified protein activity. 

19. The process of claim 18 which further 
comprises, prior to said mutagenesis, selectively 
recovering from the heterogeneous nucleic acid population 
nucleic acid which comprises nucleic acid sequences 
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coding for at least one common protein characteristic, 
which can be the same or different from the specified 
protein activity. 

20. The process of claim 19 wherein the at least 
one common characteristic is at least one common enzyme 
activity. 

21. The process of claim 20 wherein recovering the 
nucleic acid preparation comprises contacting the nucleic 
acid population with a specific binding partner for at 
least a portion of the coding sequences. 

22. The process of claim 21 wherein the specific 
binding partner is a solid phase bound hybridization 
probe . 

23. The process of claim 22 which comprises 
recovering nucleic acid from clones containing nucleic 
acid from the heterogeneous nucleic acid population which 
exhibit the class of enzyme activity. 

24. The process of claim 18 wherein the mutagenesis 
is site-specific or random mutagenesis. 

25. A process for providing a thermostable enzyme 
having improved enzyme activities at lower temperatures, 
said enzyme being a member selected from the group 
consisting of an enzyme or a polynucleotide encoding said 

enzyme comprising: 

(a) subjecting to mutagenesis at least one enzyme 
which is stable at a temperature of at least 60°C; and 

(b) screening mutants produced in (a) for a mutated 
enzyme or polynucleotide encoding a mutated enzyme, which 
mutated enzyme is stable at a temperature of at least 
60°C and which has an enzyme activity at a temperature at 
least 10° C below its optimal temperature range and which 
has activity greater than the enzyme of step (a) . 
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26. A process in which a nucleic acid library 
derived from at least one microorganism is subjected to a 
selection procedure to select therefrom double -stranded 
nucleic acid which hybridizes to one or more probe 
nucleic acid sequences which is all or a portion of a 
nucleic acid sequence encoding a protein having the 
specified protein activity, which process comprises: 

(a) contacting the double -stranded nucleic acid 
population with the nucleic acid probe bound to a ligand 
under conditions permissive of hybridization so as to 
produce a complex of probe and members of the nucleic 
acid population which hybridize thereto; 

(b) separating the complexes from the unbound 
nucleic acid population; 

(c) releasing from the probe the members of the 
population which had bound to the probe 

(d) introducing the double -stranded nucleic acid of 
(c) into a suitable host to form a library containing a 

plurality of clones containing the selected nucleic acid; 
and 

(e) screening the library for the specified protein 
activity. 
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10 ng (lib 17) 
1 ng 

100 pg p I G. I A 
10 PO 

a 
b 

c Gene specific primers 

d (1:100 dilution) 
e 
f 

standards 



1 pg (lib 17) 
100 fg 

1 tg F I G.I B 

a 
b 

jj Vector specific primers 
e (1:1000 dilution) 
f 

standards 
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2/1 

FI G. 3A 



Match with FIG. 3B 



1 ATG CTA CCA GAA GGC TTT CTC TGG GGC GTG 
1 Met Leu Pro GIu Gly Phe Leu Trp Gly Val 

61 GAC AAG CTC AGG AGG AAC ATT GAT CCG AAC 
21 Asp Lys Leu Arg Arg Asn lie Asp Pro Asn 

121 TTC AAC ATA AAG AGG GAA CTC GTC AGC GGC 
41 Phe Asn lie Lys Arg Glu Leu Val Ser Gly 

181 GAA CTT TAC GAG AAG GAT CAC CCC CTC GCC 

61 Glu Leu Tyr Glu Lys Asp His Pro Leu Ala 

241 GGA ATA GAG TGG AGC AGG ATC TTT CCC TGG 
81 Gly lie Glu Trp Ser Arg He Phe Pro Trp 

301 CGG GAC AGC TAC GGA CTC GTG AAG GAC GTC 
101 Arg Asp Ser Tyr Gly Leu Val Lys Asp Val 

361 GAC GAG ATA GCG AAT CAT C AG GAG ATA GCC 

121 Asp Glu He Ala Asn His Gin Glu He Ala 

421 GAG CTG GGC TTC AAG GTC ATC GTG AAC CTC 
141 Glu Leu Gly Phe Lys Val He Val Asn Leu 

481 GAT CCG ATA ATC GCG AGG GAG AAG GCC CTC 
161 Asp Pro He He Ala Arg Glu Lys Ala Leu 

541 GAG AGC GTG GTG GAG TTC GCC AAG TAC GCG 

181 Glu Ser Val Val Glu Phe Ala Lys Tyr Ala 

Match with FIG. 3 c . c~ 
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pip 2> R 

J Match with FIG . 3 A ^ L> 

TCC CAG TCC GGC TTT CAG TTC GAG ATG GGC 60 

Ser Gin Ser Gly Phe Gin Phe Glu Met Gly 20 

ACA GAC TGG TGG AAG TGG GTC AGG GAT CCC 120 
Thr Asp Trp Trp Lys Trp Val Arg Asp Pro 40 

GAC CTG CCC GAG GAG GGG ATA AAC AAC TAC 
Asp Leu Pro Glu Glu G)y lie Asn Asn Tyr 

AGA GAC CTC GGT CTG AAC GTT TAC AGG ATT 240 

Arg Asp Leu Gly Leu Asn Val Tyr Arg lie 80 

CCA ACG TGG TTT GTG GAG GTT GAC GTT GAA 300 
Pro Thr Trp Phe Val Glu Val Asp Val Glu 100 



AAA ATC GAT AAA 



180 
60 



360 



Lys lie Asp Lys Asp Thr Leu Glu Glu Leu 120 

TAC TAC CGC CGC GTT ATA GAG CAC CTC AGG 420 

Tyr Tyr Arg Arg Val lie Glu His Leu Arg 140 

AAC CAC TTC ACG CTC CCC CTC TGG CTT CAC 480 

Asn His Phe Thr Leu Pro Leu Trp Leu His 160 

ACC AAC GGT AGG ATT GGC TGG GTC GGG CAG 540 

Thr Asn Gly Arg He Gly Trp Val Gly Gin 180 

GCG TAC ATC GCG AAC GCA CTC GGG GAC CTC 600 

Ala Tyr He Ala Asn Ala Leu Gly Asp Leu 200 

MATCH WITH FIG. 3D 
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Match with FIG. 3 A 4 1 7 

FIG. 3C 



Match with FIG. 3D 



601 GTT GAT ATG TGG AGC ACC TTC AAC GAR CCG 

201 Val Asp Met Trp Ser Thr Phe Asn Glu Pro 
661 CCC TAC TCC GGY TTT CCN CCG GGG GTT ATG 

221 Pro Tyr Ser Gly Phe Pro Pro Gly Val Met 

♦ 

721 AAC ATG ATA AAC GCC CAC GCA CTG GCC TAC 
241 Asn Met He Asn Ala His Ala Leu Ala Tyr 

781 GCC GAT AAG GAT TCC CGC TCC GAG GCC GAG 
261 Ala Asp Lys Asp Ser Arg Ser Glu Ala Glu 

841 NCC TAT CCA NAC GAC TCC AAC GAC CCN AAG 
281 Xxx Tyr Pro Xxx Asp Ser Asn Asp Pro Lys 

901 TTC CAC AGC GGG CTC TTC TTC GAC GCA ATC 

301 Phe His Ser Gly Leu Phe Phe Asp Ala lie 

961 GGT GAG ACC TTC GTC AAA GTT CGG CAT CTC 
321 Gly Glu Thr Phe Val Lys Val Arg His Leu 

1021 TAC ACG AGA GAA GTC GTC AGG TAT TCG GAG 
341 Tyr Thr Arg Glu Val Val Arg Tyr Ser Glu 

1081 TTC CGG GGA GTT CAC AAC TAC GGT TAC GCC 
361 Phe Arg Gly Val His Asn Tyr Gly Tyr Ala 

1141 AGG CCC GTA AGC GAC ATC GGC TGG GAG ATC 

381 Arg Pro Val Ser Asp He Gly Trp Glu He 

Match with FIG. 3E 
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5 1 7 Match with FIG. 3B 

Fl G. 3D 

Match with FIG. 3C 

ATG GTC GTT GTG GAN CTC GGT TAC CTC GCG 660 

Met Val Val Val Xxx Leu Gly Tyr Leu Ala 220 

AAC CCC GAG GCG GMN AAN CTG GCA ATC CTC 720 

Asn Pro Glu Ala Xxx Xxx Leu Ala lie Leu 240 

AAG ATG ATA AAG AAG TTC GAC AGG GTA AAG 780 

Lys Met lie Lys Lys Phe Asp Arg Val Lys 260 

GTC GGG ATA ATC TAC AAC AAC ATA GGC GTT 840 

Val Gly He He Tyr Asn Asn He Gly Val 280 

GAC GTG AAA NCT NCA GAA AAC GAC AAC TAC 900 

Asp Val Lys Xxx Xxx Glu Asn Asp Asn Tyr 300 

CAC AAG GGC AAG CTC AAC ATC GAG TTC GAC 960 

His Lys Gly Lys Leu Asn He Glu Phe Asp 320 

AGG GGG AAC GAC TGG ATA GGC. GTT AAC TAC 1020 

Arg Gly Asn Asp Trp He Gly Val Asn Tyr 340 

* 

CCC AAG TTC CCG AGC ATA CCC CTG ATA TCC 1080 

Pro Lys Phe Pro Ser He Pro Leu He Ser 360 

TGC AGG CCC GGG AGT TCT TCC GCC GAC GGA 1140 

Cys Arg Pro Gly Ser Ser Ser Ala Asp Gly 380 

TAT CCG GAG GGG ATC TAC GAC TCG ATA AGA 1200 

Tyr Pro Glu Gly He Tyr Asp Ser He Arg 400 
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MATCH WITH FIG. 3C 

Fl G. 3 E 



6/7 



MATCH WITH FIG. 3F 



1201 GAG GCC AAC AAA TAC GGG. GTC CCG GTT TAC 

401 Glu Ala Asn Lys Tyr Gly Val Pro Val Tyr 

1261 GAC ACC CTG CGG CCG TAC TAC CTC GCG AGC 

421 Asp Thr Leu Arg Pro Tyr Tyr Leu Ala Ser 

1321 GCG GGT TAC GAC GTC AGG GGC TAC CTC TAC 

441 Ala Gly Tyr Asp Val Arg Gly Tyr Leu Tyr 

1381 CTC GGT TTC AGG ATG AGG TTC GGC CTC TAT 

461 Leu Gly Phe Arg Met Arg Phe Gly Leu Tyr 

1441 CCG CGG GAG GAA AGC GTA AAG GTT TAT AGG 

481 Pro Arg Glu Glu Ser Val Lys Val Tyr Arg 

1501 GAA ATC CGG GAG AAG TTC GGA CTT GGG TGA 

501 Glu lie Arg Glu Lys Phe Gly Leu Gly End 



F 1 G . 3A 


F 1 G. 3B 


FIG. 3C 


FIG. 3D 


FIG. 3E 


FIG.3F 



Fl G. 3 
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7 / 7 



Match with FIG. 3E 

GTC ACC GAA AAC GGA ATA 
Val Thr Glu Asn Gly He 

CAT GTA GCG AAG ATT GAG 

His Val Ala Lys He Glu 

TGG GCG CTG ACC GAC AAC 
Trp Ala Leu Thr Asp Asn 

AAA GTG GAT CTC ATA ACC 

Lys Val Asp Leu He Thr 

GGC ATC GTG GAG AAC AAC 

Gly He Val Glu Asn Asn 

1530 
510 



Match with FIG. 3D 

Fl G. 3F 



GCC GAT TCA ACT 1260 

Ala Asp Ser Thr 420 

GAG GCG TAC GAG 1320 

Glu Ala Tvr Glu 440 

TAC GAG TGG GCC 1380 

Tyr Glu Trp Ala 460 

AAG GAG AGA ACA 1440 

Lys Glu Arg Thr 480 

GGA GTG AGC AAG 1500 

Gly Val Ser Lys 500 
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